Publication:
Effect of architectures and training methods on the performance of learned video frame prediction

dc.contributor.departmentN/A
dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.kuauthorYılmaz, Mustafa Akın
dc.contributor.kuauthorTekalp, Ahmet Murat
dc.contributor.kuprofilePhD Student
dc.contributor.kuprofileFaculty Member
dc.contributor.otherDepartment of Electrical and Electronics Engineering
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.yokidN/A
dc.contributor.yokid26207
dc.date.accessioned2024-11-09T23:14:39Z
dc.date.issued2019
dc.description.abstractWe analyze the performance of feedforward vs. recurrent neural network (RNN) architectures and associated training methods for learned frame prediction. To this effect, we trained a residual fully convolutional neural network (FCNN), A convolutional RNN (CRNN), and a convolutional long short-term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. the CRNN can be trained stably and very efficiently using the stateful truncated backpropagation through time procedure, and it requires an order of magnitude less inference runtime to achieve near real-time frame prediction with an acceptable performance.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsorshipTUBITAK[217E033]
dc.description.sponsorshipTurkish academy of Sciences (TUBa) This work was supported by TUBITAKproject 217E033. a. Murat Tekalp also acknowledges support from Turkish academy of Sciences (TUBa).
dc.identifier.doiN/A
dc.identifier.isbn978-1-5386-6249-6
dc.identifier.issn1522-4880
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85076821510
dc.identifier.urihttps://hdl.handle.net/20.500.14288/10180
dc.identifier.wos521828604061
dc.keywordsFrame prediction
dc.keywordsDeep learning
dc.keywordsRecurrent neural networks
dc.keywordsStateful training
dc.keywordsConvolutional neural networks
dc.languageEnglish
dc.publisherIEEE
dc.source2019 IEEE international Conference on Image Processing (Icip)
dc.subjectDiagnostic imaging
dc.subjectPhotography
dc.titleEffect of architectures and training methods on the performance of learned video frame prediction
dc.typeConference proceeding
dspace.entity.typePublication
local.contributor.authorid0000-0002-0795-8970
local.contributor.authorid0000-0003-1465-8121
local.contributor.kuauthorYılmaz, Mustafa Akın
local.contributor.kuauthorTekalp, Ahmet Murat
relation.isOrgUnitOfPublication21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication.latestForDiscovery21598063-a7c5-420d-91ba-0cc9b2db0ea0

Files