Publication:
A new multi-picture architecture for learned video deinterlacing and demosaicing with parallel deformable convolution and self-attention blocks

dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.kuauthorJi, Ronglei
dc.contributor.kuauthorTekalp, Ahmet Murat
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned2025-01-19T10:31:50Z
dc.date.issued2024
dc.description.abstractDespite the fact real-world video deinterlacing and demosaicing are well-suited to supervised learning from synthetically degraded data because the degradation models are known and fixed, learned video deinterlacing and demosaicing have received much less attention compared to denoising and super-resolution tasks. We propose a new multi-picture architecture for video deinterlacing or demosaicing by aligning multiple supporting pictures with missing data to a reference picture to be reconstructed, benefiting from both local and global spatio-temporal correlations in the feature space using modified deformable convolution blocks and a novel residual efficient top-k self-attention (kSA) block, respectively. Separate reconstruction blocks are used to estimate different types of missing data. Our extensive experimental results, on synthetic or real-world datasets, demonstrate that the proposed novel architecture provides superior results that significantly exceed the state-of-the-art for both tasks in terms of PSNR, SSIM, and perceptual quality. Ablation studies are provided to justify and show the benefit of each novel modification made to the deformable convolution and residual efficient kSA blocks. Code is available: https://github.com/KUIS-AI-Tekalp-Research-Group/Video-Deinterlacing. © 2023
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsorshipThis work is supported by TUBITAK 2247-A Award No. 120C156 and KUIS AI Center funded by Turkish Is Bank. AMT acknowledges Turkish Academy of Sciences. RJ acknowledges support from Fung Foundation.
dc.description.volume146
dc.identifier.doi10.1016/j.imavis.2024.105023
dc.identifier.eissn1872-8138
dc.identifier.issn2628856
dc.identifier.quartileQ1
dc.identifier.scopus2-s2.0-85190533371
dc.identifier.urihttps://doi.org/10.1016/j.imavis.2024.105023
dc.identifier.urihttps://hdl.handle.net/20.500.14288/26302
dc.identifier.wos1232237000001
dc.keywordsDeep learning
dc.keywordsDeinterlacing
dc.keywordsDemosaicing
dc.keywordsEfficient self-attention
dc.keywordsModified deformable convolution
dc.language.isoeng
dc.publisherElsevier Ltd
dc.relation.grantnoKUIS; Turkish Is Bank; Fung Foundation; Türkiye Bilimler Akademisi; Türkiye Bilimsel ve Teknolojik Araştırma Kurumu, TÜBİTAK, (120C156)
dc.relation.ispartofImage and Vision Computing
dc.subjectElectrical and electronics engineering
dc.titleA new multi-picture architecture for learned video deinterlacing and demosaicing with parallel deformable convolution and self-attention blocks
dc.typeJournal Article
dspace.entity.typePublication
local.contributor.kuauthorTekalp, Ahmet Murat
local.contributor.kuauthorJi, Ronglei
local.publication.orgunit1College of Engineering
local.publication.orgunit1GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit2Department of Electrical and Electronics Engineering
local.publication.orgunit2Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
IR04823.pdf
Size:
1.63 MB
Format:
Adobe Portable Document Format