Perception-distortion trade-off in the SR space spanned by flowmodels
dc.contributor.authorid | 0000-0003-1465-8121 | |
dc.contributor.authorid | N/A | |
dc.contributor.authorid | 0000-0002-5078-4590 | |
dc.contributor.authorid | 0000-0002-6280-8422 | |
dc.contributor.coauthor | Erdem, Erkut | |
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.department | N/A | |
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
dc.contributor.kuauthor | Korkmaz, Cansu | |
dc.contributor.kuauthor | Doğan, Zafer | |
dc.contributor.kuauthor | Erdem, Aykut | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | PhD Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.researchcenter | Koç Üniversitesi İş Bankası Yapay Zeka Uygulama ve Araştırma Merkezi (KUIS AI)/ Koç University İş Bank Artificial Intelligence Center (KUIS AI) | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | 26207 | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 280658 | |
dc.contributor.yokid | 20331 | |
dc.date.accessioned | 2025-01-19T10:32:50Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Flow-based generative super-resolution (SR) models learn to produce a diverse set of feasible SR solutions, called the SR space. Diversity of SR solutions increases with the temperature (t) of latent variables, which introduces random variations of texture among sample solutions, resulting in visual artifacts and low fidelity. In this paper, we present a simple but effective image ensembling/fusion approach to obtain a single SR image eliminating random artifacts and improving fidelity without significantly compromising perceptual quality. We achieve this by benefiting from a diverse set of feasible photorealistic solutions in the SR space spanned by flow models. We propose different image ensembling and fusion strategies which offer multiple paths to move sample solutions in the SR space to more desired destinations in the perception-distortion plane in a controllable manner depending on the fidelity vs. perceptual quality requirements of the task at hand. Experimental results demonstrate that our image ensembling/fusion strategy achieves more promising perception-distortion trade-off compared to sample SR images produced by flow models and adversarially trained models in terms of both quantitative metrics and visual quality. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | Green Submitted | |
dc.description.publisherscope | International | |
dc.description.sponsors | This work was supported in part by an AI Fellowship to C. Korkmaz provided by the KUIS AI Center. This work was supported in part by TUBITAK 2247-A Award No. 120C156, TUBITAK 2232 Award No. 118C337, and KUIS AI Center funded by Turkish Is Bank. AMT acknowledges support from Turkish Academy of Sciences (TUBA), and AE acknowledges BAGEP Award of the Science Academy. | |
dc.identifier.doi | 10.1109/ICIP46576.2022.9897761 | |
dc.identifier.isbn | 978-1-6654-9620-9 | |
dc.identifier.issn | 1522-4880 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-85146649275 | |
dc.identifier.uri | https://doi.org/10.1109/ICIP46576.2022.9897761 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/26470 | |
dc.identifier.wos | 1058109502098 | |
dc.keywords | Normalizing flows | |
dc.keywords | Super-resolution | |
dc.keywords | Image ensembles | |
dc.keywords | Image fusion | |
dc.keywords | Perception-distortion trade-off | |
dc.language | en | |
dc.publisher | IEEE | |
dc.relation.grantno | AI Fellowship by the KUIS AI Center; TUBITAK 2247-A Award [120C156]; TUBITAK 2232 Award [118C337]; KUIS AI Center - Turkish Is Bank; Turkish Academy of Sciences (TUBA); BAGEP Award of the Science Academy | |
dc.source | 2022 IEEE International Conference on Image Processing, ICIP | |
dc.subject | Electrical and electronics engineering | |
dc.subject | Computer engineering | |
dc.title | Perception-distortion trade-off in the SR space spanned by flowmodels | |
dc.type | Conference proceeding |