Publication: End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression
dc.contributor.department | N/A | |
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.kuauthor | Yılmaz, Mustafa Akın | |
dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
dc.contributor.kuprofile | PhD Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Electrical and Electronics Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 26207 | |
dc.date.accessioned | 2024-11-09T23:00:07Z | |
dc.date.issued | 2020 | |
dc.description.abstract | Conventional video compression methods employ a linear transform and block motion model, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to combinatorial nature of the end-to-end optimization problem. Learned video compression allows end-to-end rate-distortion optimized training of all nonlinear modules, quantization parameter and entropy model simultaneously. While previous work on learned video compression considered training a sequential video codec based on end-to-end optimization of cost averaged over pairs of successive frames, it is well-known in conventional video compression that hierarchical, bi-directional coding outperforms sequential compression. In this paper, we propose for the first time end-to-end optimization of a hierarchical, bi-directional motion compensated learned codec by accumulating cost function over fixed-size groups of pictures (GOP). Experimental results show that the rate-distortion performance of our proposed learned bi-directional GOP coder outperforms the state-of-the-art end-to-end optimized learned sequential compression as expected. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.indexedby | PubMed | |
dc.description.openaccess | YES | |
dc.description.sponsorship | TUBITAK project [217E033] | |
dc.description.sponsorship | Turkish Academy of Sciences (TUBA) This work was supported by TUBITAK project 217E033. A. Murat Tekalp also acknowledges support from Turkish Academy of Sciences (TUBA). | |
dc.identifier.doi | N/A | |
dc.identifier.isbn | 978-1-7281-6395-6 | |
dc.identifier.issn | 1522-4880 | |
dc.identifier.scopus | 2-s2.0-85098622360 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/8005 | |
dc.identifier.wos | 646178501083 | |
dc.keywords | Video compression | |
dc.keywords | Deep learning | |
dc.keywords | Bi-directional motion compensation | |
dc.keywords | Group of pictures | |
dc.keywords | End-to-end optimization | |
dc.language | English | |
dc.publisher | Ieee | |
dc.source | 2020 Ieee International Conference On Image Processing (Icip) | |
dc.subject | Diagnostic imaging | |
dc.subject | Photography | |
dc.title | End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0002-0795-8970 | |
local.contributor.authorid | 0000-0003-1465-8121 | |
local.contributor.kuauthor | Yılmaz, Mustafa Akın | |
local.contributor.kuauthor | Tekalp, Ahmet Murat | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 |