Research Outputs

Permanent URI for this communityhttps://hdl.handle.net/20.500.14288/2

Browse

Search Results

Now showing 1 - 2 of 2
  • Placeholder
    Publication
    Flexible luma-chroma bit allocation in learned image compression for high-fidelity sharper images
    (IEEE, 2022) N/A; Department of Electrical and Electronics Engineering; Ulaş, Ökkeş Uğur; Tekalp, Ahmet Murat; Master student; Faculty Member; Department of Electrical and Electronics Engineering; Graduate School of Sciences and Engineering; College of Engineering; N/A; 26207
    High-fidelity learned image/video compression solutions are typically optimized with respect to l1 or l2 loss in RGB 444 format and evaluated by RGB PSNR. It is well-known that optimization of a fidelity criterion results in blurry images, which is typically alleviated by adding a content-based and/or adversarial loss terms. However, such conditional generative models result in loss of fidelity. In this paper, we propose a simple solution to obtain sharper images without losing fidelity based on learned flexible-rate coding using gained variational auto-encoder (gained-VAE) in the luma-chroma (YCrCb 444) domain. This allows us to implement image-adaptive luma-chroma bit allocation during inference, i.e., to increase Y PSNR at the expense of slightly lower chroma PSNR to obtain sharper images without introducing color artifacts based on the observation that Y PSNR correlates with image sharpness better than RGB PSNR. We note that the proposed inference-time image-adaptive luma-chroma bit allocation strategy can be incorporated into any VAE-based image compression model. Experimental results show that sharper images with better VMAF and Y PSNR can be obtained by optimizing models for YCrCb MSE with the proposed image-adaptive luma-chroma bit/quality allocation compared to stateof-the-art models optimizing RGB MSE at the same bpp.
  • Placeholder
    Publication
    Flexible-rate learned hierarchical bi-directional video compression with motion refinement and frame-level bit allocation
    (IEEE Computer Society, 2022) Department of Electrical and Electronics Engineering; N/A; N/A; Tekalp, Ahmet Murat; Yılmaz, Mustafa Akın; Çetin, Eren; Faculty Member; PhD Student; Undergraduate Student; Department of Electrical and Electronics Engineering; College of Engineering; Graduate School of Sciences and Engineering; College of Engineering; 26207; N/A; N/A
    This paper presents improvements and novel additions to our recent work on end-to-end optimized hierarchical bidirectional video compression [1] to further advance the state-of-the-art in learned video compression. As an improvement, we combine motion estimation and prediction modules and compress refined residual motion vectors for improved rate-distortion performance. As novel addition, we adapted the gain unit proposed for image compression to flexible-rate video compression in two ways: first, the gain unit enables a single encoder model to operate at multiple rate-distortion operating points; second, we exploit the gain unit to control bit allocation among intra-coded vs. bi-directionally coded frames by fine tuning corresponding models for truly flexible-rate learned video coding. Experimental results demonstrate that we obtain state-of-the-art rate-distortion performance exceeding those of all prior art in learned video coding.