Effect of architectures and training methods on the performance of learned video frame prediction

Publication:
Effect of architectures and training methods on the performance of learned video frame prediction

dc.contributor.department	Department of Electrical and Electronics Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.kuauthor	Tekalp, Ahmet Murat
dc.contributor.kuauthor	Yılmaz, Mustafa Akın
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-09T23:14:39Z
dc.date.issued	2019
dc.description.abstract	We analyze the performance of feedforward vs. recurrent neural network (RNN) architectures and associated training methods for learned frame prediction. To this effect, we trained a residual fully convolutional neural network (FCNN), A convolutional RNN (CRNN), and a convolutional long short-term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. the CRNN can be trained stably and very efficiently using the stateful truncated backpropagation through time procedure, and it requires an order of magnitude less inference runtime to achieve near real-time frame prediction with an acceptable performance.
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.openaccess	YES
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	TÜBİTAK
dc.description.sponsorship	TUBITAK[217E033]
dc.description.sponsorship	Turkish academy of Sciences (TUBa) This work was supported by TUBITAKproject 217E033. a. Murat Tekalp also acknowledges support from Turkish academy of Sciences (TUBa).
dc.identifier.isbn	978-1-5386-6249-6
dc.identifier.issn	1522-4880
dc.identifier.quartile	N/A
dc.identifier.scopus	2-s2.0-85076821510
dc.identifier.uri	https://hdl.handle.net/20.500.14288/10180
dc.identifier.wos	521828604061
dc.keywords	Frame prediction
dc.keywords	Deep learning
dc.keywords	Recurrent neural networks
dc.keywords	Stateful training
dc.keywords	Convolutional neural networks
dc.language.iso	eng
dc.publisher	IEEE
dc.relation.ispartof	2019 IEEE international Conference on Image Processing (Icip)
dc.subject	Diagnostic imaging
dc.subject	Photography
dc.title	Effect of architectures and training methods on the performance of learned video frame prediction
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Yılmaz, Mustafa Akın
local.contributor.kuauthor	Tekalp, Ahmet Murat
local.publication.orgunit1	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1	College of Engineering
local.publication.orgunit2	Department of Electrical and Electronics Engineering
local.publication.orgunit2	Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Collections

Publications without Fulltext

Publication: Effect of architectures and training methods on the performance of learned video frame prediction

Files

Collections

Publication:
Effect of architectures and training methods on the performance of learned video frame prediction