Video frame prediction via deep learning

Publication:
Video frame prediction via deep learning

dc.contributor.department	Department of Electrical and Electronics Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.kuauthor	Tekalp, Ahmet Murat
dc.contributor.kuauthor	Yılmaz, Mustafa Akın
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-09T23:13:11Z
dc.date.issued	2020
dc.description.abstract	This paper provides new results over our previous work presented in ICIP 2019 on the performance of learned frame prediction architectures and associated training methods. More specifically, we show that using an end-to-end residual connection in the fully convolutional neural network (FCNN) provides improved performance. in order to provide comparative results, we trained a residual FCNN, A convolutional RNN (CRNN), and a convolutional long-short term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. the CRNN can be stably and efficiently trained using the stateful truncated backpropagation through time procedure, and requires an order of magnitude less inference runtime to achieve an acceptable performance in near real-time.
dc.description.indexedby	WOS
dc.description.openaccess	NO
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	TÜBİTAK
dc.description.sponsorship	TUBITAKproject [217E033]
dc.description.sponsorship	Turkish academy of Sciences (TUBa) This work was supported by TUBITAKproject 217E033. a. Murat Tekalp also acknowledges support from Turkish academy of Sciences (TUBa).
dc.identifier.isbn	978-1-7281-7206-4
dc.identifier.issn	2165-0608
dc.identifier.quartile	N/A
dc.identifier.uri	https://hdl.handle.net/20.500.14288/9946
dc.identifier.wos	653136100021
dc.keywords	Frame prediction
dc.keywords	Deep learning
dc.keywords	Recurrent network architectures
dc.keywords	Stateful training
dc.keywords	Convolutional network architectures
dc.language.iso	tur
dc.publisher	IEEE
dc.relation.ispartof	2020 28th Signal Processing and Communications Applications Conference (Siu)
dc.subject	Civil engineering
dc.subject	Electrical electronics engineering
dc.subject	Telecommunication
dc.title	Video frame prediction via deep learning
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Yılmaz, Mustafa Akın
local.contributor.kuauthor	Tekalp, Ahmet Murat
local.publication.orgunit1	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1	College of Engineering
local.publication.orgunit2	Department of Electrical and Electronics Engineering
local.publication.orgunit2	Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Collections

Publications without Fulltext

Publication: Video frame prediction via deep learning

Files

Collections

Publication:
Video frame prediction via deep learning