Publication:
Flexible luma-chroma bit allocation in learned image compression for high-fidelity sharper images

dc.contributor.departmentN/A
dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.kuauthorUlaş, Ökkeş Uğur
dc.contributor.kuauthorTekalp, Ahmet Murat
dc.contributor.kuprofileMaster student
dc.contributor.kuprofileFaculty Member
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.yokidN/A
dc.contributor.yokid26207
dc.date.accessioned2024-11-09T23:28:39Z
dc.date.issued2022
dc.description.abstractHigh-fidelity learned image/video compression solutions are typically optimized with respect to l1 or l2 loss in RGB 444 format and evaluated by RGB PSNR. It is well-known that optimization of a fidelity criterion results in blurry images, which is typically alleviated by adding a content-based and/or adversarial loss terms. However, such conditional generative models result in loss of fidelity. In this paper, we propose a simple solution to obtain sharper images without losing fidelity based on learned flexible-rate coding using gained variational auto-encoder (gained-VAE) in the luma-chroma (YCrCb 444) domain. This allows us to implement image-adaptive luma-chroma bit allocation during inference, i.e., to increase Y PSNR at the expense of slightly lower chroma PSNR to obtain sharper images without introducing color artifacts based on the observation that Y PSNR correlates with image sharpness better than RGB PSNR. We note that the proposed inference-time image-adaptive luma-chroma bit allocation strategy can be incorporated into any VAE-based image compression model. Experimental results show that sharper images with better VMAF and Y PSNR can be obtained by optimizing models for YCrCb MSE with the proposed image-adaptive luma-chroma bit/quality allocation compared to stateof-the-art models optimizing RGB MSE at the same bpp.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.openaccessNO
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsorshipTUBITAK 2247-A Award [120C156]
dc.description.sponsorshipKUIS AI Center - Turkish Is Bank
dc.description.sponsorshipTurkish Academy of Sciences (TUBA) This work is supported in part by TUBITAK 2247-A Award No. 120C156 and KUIS AI Center funded by Turkish Is Bank. A. M. Tekalp also acknowledges support from Turkish Academy of Sciences (TUBA).
dc.identifier.doi10.1109/PCS56426.2022.10017994
dc.identifier.eissn2472-7822
dc.identifier.isbn978-1-6654-9257-7
dc.identifier.issn2330-7935
dc.identifier.scopus2-s2.0-85147667450
dc.identifier.urihttp://dx.doi.org/10.1109/PCS56426.2022.10017994
dc.identifier.urihttps://hdl.handle.net/20.500.14288/11929
dc.identifier.wos926892300006
dc.keywordsGained variational autoencoder
dc.keywordsFlexible luma chroma bit allocation
dc.keywordsLuma PSNR
dc.keywordsImage sharpness
dc.languageEnglish
dc.publisherIEEE
dc.source2022 Picture Coding Symposium (Pcs)
dc.subjectElectrical electronic engineering
dc.subjectImaging system
dc.subjectNeural computers
dc.subjectNeural networks (Computer science)
dc.subjectImage processing
dc.subjectPhotography
dc.titleFlexible luma-chroma bit allocation in learned image compression for high-fidelity sharper images
dc.typeConference proceeding
dspace.entity.typePublication
local.contributor.authorid0000-0001-9817-2342
local.contributor.authorid0000-0003-1465-8121
local.contributor.kuauthorUlaş, Ökkeş Uğur
local.contributor.kuauthorTekalp, Ahmet Murat
relation.isOrgUnitOfPublication21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication.latestForDiscovery21598063-a7c5-420d-91ba-0cc9b2db0ea0

Files