Publication: Multi-scale deformable alignment and content-adaptive inference for flexible-rate bi-directional video compression
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.kuauthor | Yılmaz, Mustafa Akın | |
dc.contributor.kuauthor | Ulaş, Ökkeş Uğur | |
dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
dc.contributor.other | Department of Electrical and Electronics Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.date.accessioned | 2024-12-29T09:41:10Z | |
dc.date.issued | 2023 | |
dc.description.abstract | The lack of ability to adapt the motion compensation model to video content is an important limitation of current end-to-end learned video compression models. This paper advances the state-of-the-art by proposing an adaptive motion-compensation model for end-to-end rate-distortion optimized hierarchical bi-directional video compression. In particular, we propose two novelties: i) a multi-scale deformable alignment scheme at the feature level combined with multi-scale conditional coding, ii) motion-content adaptive inference. In addition, we employ a gain unit, which enables a single model to operate at multiple rate-distortion operating points. We also exploit the gain unit to control bit allocation among intra-coded vs. bi-directionally coded frames by fine tuning corresponding models for truly flexible-rate learned video coding. Experimental results demonstrate state-of-the-art rate-distortion performance exceeding those of all prior art in learned video coding1. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | TÜBİTAK | |
dc.description.sponsors | This work is supported in part by TUBITAK 2247-A Award No. 120C156 and KUIS AI Center funded by Turkish Is Bank. A. M. Tekalp also acknowledges support from Turkish Academy of Sciences (TUBA). | |
dc.identifier.doi | 10.1109/ICIP49359.2023.10223112 | |
dc.identifier.isbn | 978-172819835-4 | |
dc.identifier.issn | 1522-4880 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-85180761136 | |
dc.identifier.uri | https://doi.org/10.1109/ICIP49359.2023.10223112 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/23560 | |
dc.identifier.wos | 1106821002113 | |
dc.keywords | Bi-directional video compression | |
dc.keywords | Content-adaptive inference | |
dc.keywords | End-to-end rate-distortion optimization | |
dc.keywords | Flexible-rate coding | |
dc.keywords | Hierarchical B pictures | |
dc.language | en | |
dc.publisher | IEEE Computer Society | |
dc.relation.grantno | KUIS | |
dc.relation.grantno | TUBA | |
dc.relation.grantno | Turkish Is Bank | |
dc.relation.grantno | Türkiye Bilimsel ve Teknolojik Araştırma Kurumu, TÜBİTAK, (120C156) | |
dc.relation.grantno | Türkiye Bilimler Akademisi | |
dc.source | Proceedings - International Conference on Image Processing, ICIP | |
dc.subject | Computer science | |
dc.subject | Artificial intelligence | |
dc.subject | Theory | |
dc.subject | Methods | |
dc.title | Multi-scale deformable alignment and content-adaptive inference for flexible-rate bi-directional video compression | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Yılmaz, Mustafa Akın | |
local.contributor.kuauthor | Ulaş, Ökkeş Uğur | |
local.contributor.kuauthor | Tekalp, Ahmet Murat | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 |