Publication:
Perception-distortion trade-off in the SR space spanned by flowmodels

dc.contributor.coauthorErdem, Erkut
dc.contributor.departmentKUIS AI (Koç University & İş Bank Artificial Intelligence Center)
dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.kuauthorErdem, Aykut
dc.contributor.kuauthorDoğan, Zafer
dc.contributor.kuauthorKorkmaz, Cansu
dc.contributor.kuauthorTekalp, Ahmet Murat
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.contributor.schoolcollegeinstituteResearch Center
dc.date.accessioned2025-01-19T10:32:50Z
dc.date.issued2022
dc.description.abstractFlow-based generative super-resolution (SR) models learn to produce a diverse set of feasible SR solutions, called the SR space. Diversity of SR solutions increases with the temperature (t) of latent variables, which introduces random variations of texture among sample solutions, resulting in visual artifacts and low fidelity. In this paper, we present a simple but effective image ensembling/fusion approach to obtain a single SR image eliminating random artifacts and improving fidelity without significantly compromising perceptual quality. We achieve this by benefiting from a diverse set of feasible photorealistic solutions in the SR space spanned by flow models. We propose different image ensembling and fusion strategies which offer multiple paths to move sample solutions in the SR space to more desired destinations in the perception-distortion plane in a controllable manner depending on the fidelity vs. perceptual quality requirements of the task at hand. Experimental results demonstrate that our image ensembling/fusion strategy achieves more promising perception-distortion trade-off compared to sample SR images produced by flow models and adversarially trained models in terms of both quantitative metrics and visual quality.
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.openaccessGreen Submitted
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuN/A
dc.description.sponsorshipThis work was supported in part by an AI Fellowship to C. Korkmaz provided by the KUIS AI Center. This work was supported in part by TUBITAK 2247-A Award No. 120C156, TUBITAK 2232 Award No. 118C337, and KUIS AI Center funded by Turkish Is Bank. AMT acknowledges support from Turkish Academy of Sciences (TUBA), and AE acknowledges BAGEP Award of the Science Academy.
dc.identifier.doi10.1109/ICIP46576.2022.9897761
dc.identifier.isbn978-1-6654-9620-9
dc.identifier.issn1522-4880
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85146649275
dc.identifier.urihttps://doi.org/10.1109/ICIP46576.2022.9897761
dc.identifier.urihttps://hdl.handle.net/20.500.14288/26470
dc.identifier.wos1058109502098
dc.keywordsNormalizing flows
dc.keywordsSuper-resolution
dc.keywordsImage ensembles
dc.keywordsImage fusion
dc.keywordsPerception-distortion trade-off
dc.language.isoeng
dc.publisherIEEE
dc.relation.grantnoAI Fellowship by the KUIS AI Center; TUBITAK 2247-A Award [120C156]; TUBITAK 2232 Award [118C337]; KUIS AI Center - Turkish Is Bank; Turkish Academy of Sciences (TUBA); BAGEP Award of the Science Academy
dc.relation.ispartof2022 IEEE International Conference on Image Processing, ICIP
dc.subjectElectrical and electronics engineering
dc.subjectComputer engineering
dc.titlePerception-distortion trade-off in the SR space spanned by flowmodels
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorTekalp, Ahmet Murat
local.contributor.kuauthorKorkmaz, Cansu
local.contributor.kuauthorDoğan, Zafer
local.contributor.kuauthorErdem, Aykut
local.publication.orgunit1College of Engineering
local.publication.orgunit1GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1Research Center
local.publication.orgunit2Department of Electrical and Electronics Engineering
local.publication.orgunit2Department of Computer Engineering
local.publication.orgunit2KUIS AI (Koç University & İş Bank Artificial Intelligence Center)
local.publication.orgunit2Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication77d67233-829b-4c3a-a28f-bd97ab5c12c7
relation.isOrgUnitOfPublication21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery77d67233-829b-4c3a-a28f-bd97ab5c12c7
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublicationd437580f-9309-4ecb-864a-4af58309d287
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
IR05112.pdf
Size:
14.59 MB
Format:
Adobe Portable Document Format