Publication: End-to-end rate-distortion optimization for bi-directional learned video compression
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.kuauthor | Yılmaz, Melih | |
dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Electrical and Electronics Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 26207 | |
dc.date.accessioned | 2024-11-09T13:47:24Z | |
dc.date.issued | 2020 | |
dc.description.abstract | Conventional video compression methods employ a linear transform and block motion model, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to combinatorial nature of the end-to-end optimization problem. Learned video compression allows end-to-end rate-distortion optimized training of all nonlinear modules, quantization parameter and entropy model simultaneously. While previous work on learned video compression considered training a sequential video codec based on end-to-end optimization of cost averaged over pairs of successive frames, it is well-known in conventional video compression that hierarchical, bi-directional coding outperforms sequential compression. In this paper, we propose for the first time end-to-end optimization of a hierarchical, bi-directional motion compensated learned codec by accumulating cost function over fixed-size groups of pictures (GOP). Experimental results show that the rate-distortion performance of our proposed learned bi-directional GOP coder outperforms the state-of-the-art end-to-end optimized learned sequential compression as expected. | |
dc.description.fulltext | YES | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | TÜBİTAK | |
dc.description.sponsorship | Scientific and Technological Research Council of Turkey (TÜBİTAK) | |
dc.description.sponsorship | Turkish Academy of Sciences (TUBA) | |
dc.description.version | Author's final manuscript | |
dc.format | ||
dc.identifier.doi | 10.1109/ICIP40778.2020.9190881 | |
dc.identifier.eissn | 2381-8549 | |
dc.identifier.embargo | NO | |
dc.identifier.filenameinventoryno | IR02689 | |
dc.identifier.isbn | 9781728163956 | |
dc.identifier.issn | 1522-4880 | |
dc.identifier.link | https://doi.org/10.1109/ICIP40778.2020.9190881 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-85098622360 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/3765 | |
dc.keywords | Bi-directional motion compensation | |
dc.keywords | Deep learning | |
dc.keywords | End-to-end optimization | |
dc.keywords | Group of pictures | |
dc.keywords | Video compression | |
dc.language | English | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.relation.grantno | 2.17E+35 | |
dc.relation.uri | http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/9335 | |
dc.source | 2020 IEEE International Conference on Image Processing (ICIP) | |
dc.subject | Image compression | |
dc.title | End-to-end rate-distortion optimization for bi-directional learned video compression | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | N/A | |
local.contributor.authorid | 0000-0003-1465-8121 | |
local.contributor.kuauthor | Yılmaz, Melih | |
local.contributor.kuauthor | Tekalp, Ahmet Murat | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 |
Files
Original bundle
1 - 1 of 1