Perception-distortion trade-off in the SR space spanned by flowmodels

dc.contributor.authorid0000-0003-1465-8121
dc.contributor.authoridN/A
dc.contributor.authorid0000-0002-5078-4590
dc.contributor.authorid0000-0002-6280-8422
dc.contributor.coauthorErdem, Erkut
dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.departmentN/A
dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorTekalp, Ahmet Murat
dc.contributor.kuauthorKorkmaz, Cansu
dc.contributor.kuauthorDoğan, Zafer
dc.contributor.kuauthorErdem, Aykut
dc.contributor.kuprofileFaculty Member
dc.contributor.kuprofilePhD Student
dc.contributor.kuprofileFaculty Member
dc.contributor.kuprofileFaculty Member
dc.contributor.researchcenterKoç Üniversitesi İş Bankası Yapay Zeka Uygulama ve Araştırma Merkezi (KUIS AI)/ Koç University İş Bank Artificial Intelligence Center (KUIS AI)
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.yokid26207
dc.contributor.yokidN/A
dc.contributor.yokid280658
dc.contributor.yokid20331
dc.date.accessioned2025-01-19T10:32:50Z
dc.date.issued2022
dc.description.abstractFlow-based generative super-resolution (SR) models learn to produce a diverse set of feasible SR solutions, called the SR space. Diversity of SR solutions increases with the temperature (t) of latent variables, which introduces random variations of texture among sample solutions, resulting in visual artifacts and low fidelity. In this paper, we present a simple but effective image ensembling/fusion approach to obtain a single SR image eliminating random artifacts and improving fidelity without significantly compromising perceptual quality. We achieve this by benefiting from a diverse set of feasible photorealistic solutions in the SR space spanned by flow models. We propose different image ensembling and fusion strategies which offer multiple paths to move sample solutions in the SR space to more desired destinations in the perception-distortion plane in a controllable manner depending on the fidelity vs. perceptual quality requirements of the task at hand. Experimental results demonstrate that our image ensembling/fusion strategy achieves more promising perception-distortion trade-off compared to sample SR images produced by flow models and adversarially trained models in terms of both quantitative metrics and visual quality.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.openaccessGreen Submitted
dc.description.publisherscopeInternational
dc.description.sponsorsThis work was supported in part by an AI Fellowship to C. Korkmaz provided by the KUIS AI Center. This work was supported in part by TUBITAK 2247-A Award No. 120C156, TUBITAK 2232 Award No. 118C337, and KUIS AI Center funded by Turkish Is Bank. AMT acknowledges support from Turkish Academy of Sciences (TUBA), and AE acknowledges BAGEP Award of the Science Academy.
dc.identifier.doi10.1109/ICIP46576.2022.9897761
dc.identifier.isbn978-1-6654-9620-9
dc.identifier.issn1522-4880
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85146649275
dc.identifier.urihttps://doi.org/10.1109/ICIP46576.2022.9897761
dc.identifier.urihttps://hdl.handle.net/20.500.14288/26470
dc.identifier.wos1058109502098
dc.keywordsNormalizing flows
dc.keywordsSuper-resolution
dc.keywordsImage ensembles
dc.keywordsImage fusion
dc.keywordsPerception-distortion trade-off
dc.languageen
dc.publisherIEEE
dc.relation.grantnoAI Fellowship by the KUIS AI Center; TUBITAK 2247-A Award [120C156]; TUBITAK 2232 Award [118C337]; KUIS AI Center - Turkish Is Bank; Turkish Academy of Sciences (TUBA); BAGEP Award of the Science Academy
dc.source2022 IEEE International Conference on Image Processing, ICIP
dc.subjectElectrical and electronics engineering
dc.subjectComputer engineering
dc.titlePerception-distortion trade-off in the SR space spanned by flowmodels
dc.typeConference proceeding

Files