Publication: Perception-distortion trade-off in the SR space spanned by flow models
dc.contributor.coauthor | Erdem, Erkut | |
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Graduate School of Sciences and Engineering | |
dc.contributor.department | KUIS AI (Koç University & İş Bank Artificial Intelligence Center) | |
dc.contributor.kuauthor | Doğan, Zafer | |
dc.contributor.kuauthor | Erdem, Aykut | |
dc.contributor.kuauthor | Korkmaz, Cansu | |
dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | GRADUATE SCHOOL OF SCIENCES AND ENGINEERING | |
dc.contributor.schoolcollegeinstitute | Research Center | |
dc.date.accessioned | 2024-11-10T00:05:54Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Flow-based generative super-resolution (SR) models learn to produce a diverse set of feasible SR solutions, called the SR space. Diversity of SR solutions increases with the temperature (τ) of latent variables, which introduces random variations of texture among sample solutions, resulting in visual artifacts and low fidelity. In this paper, we present a simple but effective image ensembling/fusion approach to obtain a single SR image eliminating random artifacts and improving fidelity without significantly compromising perceptual quality. We achieve this by benefiting from a diverse set of feasible photo-realistic solutions in the SR space spanned by flow models. We propose different image ensembling and fusion strategies which offer multiple paths to move sample solutions in the SR space to more desired destinations in the perception-distortion plane in a controllable manner depending on the fidelity vs. perceptual quality requirements of the task at hand. Experimental results demonstrate that our image ensembling/fusion strategy achieves more promising perception-distortion tradeoff compared to sample SR images produced by flow models and adversarially trained models in terms of both quantitative metrics and visual quality. © 2022 IEEE. | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.description.sponsorship | This work was supported in part by an AI Fellowship to C. Korkmaz provided by the KUIS AI Center. This work was supported in part by TUBITAK 2247-A Award No. 120C156, TUBITAK 2232 Award No. 118C337, and KUIS AI Center funded by Turkish Is Bank. AMT acknowledges support from Turkish Academy of Sciences (TUBA), and AE acknowledges BAGEP Award of the Science Academy. | |
dc.identifier.doi | 10.1109/ICIP46576.2022.9897761 | |
dc.identifier.isbn | 9781-6654-9620-9 | |
dc.identifier.issn | 1522-4880 | |
dc.identifier.scopus | 2-s2.0-85146649275 | |
dc.identifier.uri | https://doi.org/10.1109/ICIP46576.2022.9897761 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/16524 | |
dc.identifier.wos | 1058109502098 | |
dc.keywords | Image ensembles | |
dc.keywords | Image fusion | |
dc.keywords | Normalizing flows | |
dc.keywords | Perception-distortion trade-off | |
dc.keywords | Super-resolution Computer vision | |
dc.keywords | Economic and social effects | |
dc.keywords | Image enhancement | |
dc.keywords | Optical resolving power | |
dc.keywords | Textures | |
dc.keywords | Flow modelling | |
dc.keywords | Fusion strategies | |
dc.keywords | Image ensemble | |
dc.keywords | Normalizing flow | |
dc.keywords | Perception-distortion trade-off | |
dc.keywords | Perceptual quality | |
dc.keywords | Resolution images | |
dc.keywords | Sample solution | |
dc.keywords | Superresolution | |
dc.keywords | Trade off | |
dc.keywords | Image fusion | |
dc.language.iso | eng | |
dc.publisher | The Institute of Electrical and Electronics Engineers Signal Processing Society | |
dc.relation.ispartof | Proceedings - International Conference on Image Processing, ICIP | |
dc.subject | Convolutional neural network | |
dc.subject | Hallucinations | |
dc.subject | Sparse representation | |
dc.title | Perception-distortion trade-off in the SR space spanned by flow models | |
dc.type | Conference Proceeding | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Doğan, Zafer | |
local.contributor.kuauthor | Tekalp, Ahmet Murat | |
local.contributor.kuauthor | Erdem, Aykut | |
local.contributor.kuauthor | Korkmaz, Cansu | |
local.publication.orgunit1 | College of Engineering | |
local.publication.orgunit1 | GRADUATE SCHOOL OF SCIENCES AND ENGINEERING | |
local.publication.orgunit1 | Research Center | |
local.publication.orgunit2 | Department of Electrical and Electronics Engineering | |
local.publication.orgunit2 | Department of Computer Engineering | |
local.publication.orgunit2 | KUIS AI (Koç University & İş Bank Artificial Intelligence Center) | |
local.publication.orgunit2 | Graduate School of Sciences and Engineering | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication | 3fc31c89-e803-4eb1-af6b-6258bc42c3d8 | |
relation.isOrgUnitOfPublication | 77d67233-829b-4c3a-a28f-bd97ab5c12c7 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isParentOrgUnitOfPublication | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 | |
relation.isParentOrgUnitOfPublication | 434c9663-2b11-4e66-9399-c863e2ebae43 | |
relation.isParentOrgUnitOfPublication | d437580f-9309-4ecb-864a-4af58309d287 | |
relation.isParentOrgUnitOfPublication.latestForDiscovery | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 |