Publication: Flexible luma-chroma bit allocation in learned image compression for high-fidelity sharper images
dc.contributor.department | N/A | |
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.kuauthor | Ulaş, Ökkeş Uğur | |
dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
dc.contributor.kuprofile | Master student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 26207 | |
dc.date.accessioned | 2024-11-09T23:28:39Z | |
dc.date.issued | 2022 | |
dc.description.abstract | High-fidelity learned image/video compression solutions are typically optimized with respect to l1 or l2 loss in RGB 444 format and evaluated by RGB PSNR. It is well-known that optimization of a fidelity criterion results in blurry images, which is typically alleviated by adding a content-based and/or adversarial loss terms. However, such conditional generative models result in loss of fidelity. In this paper, we propose a simple solution to obtain sharper images without losing fidelity based on learned flexible-rate coding using gained variational auto-encoder (gained-VAE) in the luma-chroma (YCrCb 444) domain. This allows us to implement image-adaptive luma-chroma bit allocation during inference, i.e., to increase Y PSNR at the expense of slightly lower chroma PSNR to obtain sharper images without introducing color artifacts based on the observation that Y PSNR correlates with image sharpness better than RGB PSNR. We note that the proposed inference-time image-adaptive luma-chroma bit allocation strategy can be incorporated into any VAE-based image compression model. Experimental results show that sharper images with better VMAF and Y PSNR can be obtained by optimizing models for YCrCb MSE with the proposed image-adaptive luma-chroma bit/quality allocation compared to stateof-the-art models optimizing RGB MSE at the same bpp. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | NO | |
dc.description.sponsoredbyTubitakEu | TÜBİTAK | |
dc.description.sponsorship | TUBITAK 2247-A Award [120C156] | |
dc.description.sponsorship | KUIS AI Center - Turkish Is Bank | |
dc.description.sponsorship | Turkish Academy of Sciences (TUBA) This work is supported in part by TUBITAK 2247-A Award No. 120C156 and KUIS AI Center funded by Turkish Is Bank. A. M. Tekalp also acknowledges support from Turkish Academy of Sciences (TUBA). | |
dc.identifier.doi | 10.1109/PCS56426.2022.10017994 | |
dc.identifier.eissn | 2472-7822 | |
dc.identifier.isbn | 978-1-6654-9257-7 | |
dc.identifier.issn | 2330-7935 | |
dc.identifier.scopus | 2-s2.0-85147667450 | |
dc.identifier.uri | http://dx.doi.org/10.1109/PCS56426.2022.10017994 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/11929 | |
dc.identifier.wos | 926892300006 | |
dc.keywords | Gained variational autoencoder | |
dc.keywords | Flexible luma chroma bit allocation | |
dc.keywords | Luma PSNR | |
dc.keywords | Image sharpness | |
dc.language | English | |
dc.publisher | IEEE | |
dc.source | 2022 Picture Coding Symposium (Pcs) | |
dc.subject | Electrical electronic engineering | |
dc.subject | Imaging system | |
dc.subject | Neural computers | |
dc.subject | Neural networks (Computer science) | |
dc.subject | Image processing | |
dc.subject | Photography | |
dc.title | Flexible luma-chroma bit allocation in learned image compression for high-fidelity sharper images | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0001-9817-2342 | |
local.contributor.authorid | 0000-0003-1465-8121 | |
local.contributor.kuauthor | Ulaş, Ökkeş Uğur | |
local.contributor.kuauthor | Tekalp, Ahmet Murat | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 |