Video frame prediction via deep learning

Publication:
Video frame prediction via deep learning

dc.contributor.department	Department of Electrical and Electronics Engineering
dc.contributor.kuauthor	Tekalp, Ahmet Murat
dc.contributor.kuauthor	Yılmaz, Mustafa Akın
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.date.accessioned	2024-11-09T12:39:36Z
dc.date.issued	2020
dc.description.abstract	This paper provides new results over our previous work presented in ICIP 2019 on the performance of learned frame prediction architectures and associated training methods. More specifically, we show that using an end-to-end residual connection in the fully convolutional neural network (FCNN) provides improved performance. In order to provide comparative results, we trained a residual FCNN, a convolutional RNN (CRNN), and a convolutional long-short term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. The CRNN can be stably and efficiently trained using the stateful truncated backpropagation through time procedure, and requires an order of magnitude less inference runtime to achieve an acceptable performance in near real-time. / Bu bildiri, ICIP 2019’da sundugumuz öğrenilmiş video çerçeve öngörü mimarileri ve egitim yöntemleri karşılaştırılması çalışmamız üzerine yeni sonuçlar vermektedir. Daha spesifik olarak, tamamen evrisimsel sinir agında (FCNN) uçtan uca kalıntı bağlantısı kullanarak öngörü performansında gelişim sağlandığını gösteriyoruz. Sonuçları karşılaştırmak için, ortalama kare kaybını kullanarak sonraki kareyi öngörmek için kalıntı baglantılı FCNN, evrisimsel bir RNN (CRNN) ve evrisimsel uzun-kısa süreli bellek (CLSTM) agı eğittik. Yinelemeli ağlar için durum bilgisi taşımayan (stateless) ve taşıyan (stateful) egitim gerçekleştirdik. Deneysel sonuçlar, kalıntı baglantılı FCNN mimarisinin yüksek egitim ve test (çıkarım) hesaplama karmaşıklığı pahasına PSNR açısından en iyi performansı verdigini göstermektedir. CRNN, durum bilgisi taşıyan zamanda geri yayılım yoluyla kararlı olarak egitilebilmekte ve kabul edilebilir bir performansla gerçek-zamanlıya yakın çerçeve öngörüsü yapabilmektedir.
dc.description.fulltext	YES
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.openaccess	YES
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	TÜBİTAK
dc.description.sponsorship	Scientific and Technological Research Council of Turkey (TÜBİTAK)
dc.description.sponsorship	Turkish Academy of Sciences (TUBA)
dc.description.version	Author's final manuscript
dc.identifier.doi	10.1109/SIU49456.2020.9302047
dc.identifier.embargo	NO
dc.identifier.filenameinventoryno	IR03846
dc.identifier.isbn	978-1-7281-7206-4
dc.identifier.issn	2165-0608
dc.identifier.quartile	N/A
dc.identifier.scopus	2-s2.0-85100304312
dc.identifier.uri	https://doi.org/10.1109/SIU49456.2020.9302047
dc.identifier.wos	653136100021
dc.keywords	Frame prediction
dc.keywords	Deep learning
dc.keywords	Recurrent network architectures
dc.keywords	Stateful training
dc.keywords	Convolutional network architectures
dc.language.iso	tur
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.grantno	2.18E+37
dc.relation.ispartof	2020 28th Signal Processing and Communications Applications Conference (SIU)
dc.relation.uri	http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/10714
dc.subject	Electrical and electronic engineering
dc.subject	Telecommunications
dc.title	Video frame prediction via deep learning
dc.title.alternative	Derin ögrenme ile video çerçeve öngörüsü
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Tekalp, Ahmet Murat
local.contributor.kuauthor	Yılmaz, Mustafa Akın
relation.isOrgUnitOfPublication	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication.latestForDiscovery	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 10714.pdf
Size:: 414.74 KB
Format:: Adobe Portable Document Format

Download

Collections

Publications with Fulltext

Publication: Video frame prediction via deep learning

Files

Original bundle

Collections

Publication:
Video frame prediction via deep learning