Publication:
Perception-distortion trade-off in the SR space spanned by flow models

dc.contributor.coauthorErdem, Erkut
dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.departmentKUIS AI (Koç University & İş Bank Artificial Intelligence Center)
dc.contributor.kuauthorDoğan, Zafer
dc.contributor.kuauthorErdem, Aykut
dc.contributor.kuauthorKorkmaz, Cansu
dc.contributor.kuauthorTekalp, Ahmet Murat
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.contributor.schoolcollegeinstituteResearch Center
dc.date.accessioned2024-11-10T00:05:54Z
dc.date.issued2022
dc.description.abstractFlow-based generative super-resolution (SR) models learn to produce a diverse set of feasible SR solutions, called the SR space. Diversity of SR solutions increases with the temperature (τ) of latent variables, which introduces random variations of texture among sample solutions, resulting in visual artifacts and low fidelity. In this paper, we present a simple but effective image ensembling/fusion approach to obtain a single SR image eliminating random artifacts and improving fidelity without significantly compromising perceptual quality. We achieve this by benefiting from a diverse set of feasible photo-realistic solutions in the SR space spanned by flow models. We propose different image ensembling and fusion strategies which offer multiple paths to move sample solutions in the SR space to more desired destinations in the perception-distortion plane in a controllable manner depending on the fidelity vs. perceptual quality requirements of the task at hand. Experimental results demonstrate that our image ensembling/fusion strategy achieves more promising perception-distortion tradeoff compared to sample SR images produced by flow models and adversarially trained models in terms of both quantitative metrics and visual quality. © 2022 IEEE.
dc.description.indexedbyScopus
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuN/A
dc.description.sponsorshipThis work was supported in part by an AI Fellowship to C. Korkmaz provided by the KUIS AI Center. This work was supported in part by TUBITAK 2247-A Award No. 120C156, TUBITAK 2232 Award No. 118C337, and KUIS AI Center funded by Turkish Is Bank. AMT acknowledges support from Turkish Academy of Sciences (TUBA), and AE acknowledges BAGEP Award of the Science Academy.
dc.identifier.doi10.1109/ICIP46576.2022.9897761
dc.identifier.isbn9781-6654-9620-9
dc.identifier.issn1522-4880
dc.identifier.scopus2-s2.0-85146649275
dc.identifier.urihttps://doi.org/10.1109/ICIP46576.2022.9897761
dc.identifier.urihttps://hdl.handle.net/20.500.14288/16524
dc.identifier.wos1058109502098
dc.keywordsImage ensembles
dc.keywordsImage fusion
dc.keywordsNormalizing flows
dc.keywordsPerception-distortion trade-off
dc.keywordsSuper-resolution Computer vision
dc.keywordsEconomic and social effects
dc.keywordsImage enhancement
dc.keywordsOptical resolving power
dc.keywordsTextures
dc.keywordsFlow modelling
dc.keywordsFusion strategies
dc.keywordsImage ensemble
dc.keywordsNormalizing flow
dc.keywordsPerception-distortion trade-off
dc.keywordsPerceptual quality
dc.keywordsResolution images
dc.keywordsSample solution
dc.keywordsSuperresolution
dc.keywordsTrade off
dc.keywordsImage fusion
dc.language.isoeng
dc.publisherThe Institute of Electrical and Electronics Engineers Signal Processing Society
dc.relation.ispartofProceedings - International Conference on Image Processing, ICIP
dc.subjectConvolutional neural network
dc.subjectHallucinations
dc.subjectSparse representation
dc.titlePerception-distortion trade-off in the SR space spanned by flow models
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorDoğan, Zafer
local.contributor.kuauthorTekalp, Ahmet Murat
local.contributor.kuauthorErdem, Aykut
local.contributor.kuauthorKorkmaz, Cansu
local.publication.orgunit1College of Engineering
local.publication.orgunit1GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1Research Center
local.publication.orgunit2Department of Electrical and Electronics Engineering
local.publication.orgunit2Department of Computer Engineering
local.publication.orgunit2KUIS AI (Koç University & İş Bank Artificial Intelligence Center)
local.publication.orgunit2Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication77d67233-829b-4c3a-a28f-bd97ab5c12c7
relation.isOrgUnitOfPublication.latestForDiscovery21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublicationd437580f-9309-4ecb-864a-4af58309d287
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files