Publication:
Omnidirectional image quality assessment with local-global vision transformers

dc.contributor.coauthorElfkir, Mohamed Hedi
dc.contributor.coauthorImamoglu, Nevrez
dc.contributor.coauthorOzcinar, Cagri
dc.contributor.coauthorErdem, Erkut
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorTofighi, Nafiseh Jabbari
dc.contributor.kuauthorErdem, Aykut
dc.contributor.researchcenterKUIS AI (Koç University & İş Bank Artificial Intelligence Center)
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-12-29T09:37:55Z
dc.date.issued2024
dc.description.abstractWith the rising popularity of omnidirectional images (ODIs) in virtual reality applications, the need for specialized image quality assessment (IQA) methods becomes increasingly critical. Traditional IQA approaches, designed for rectilinear images, often fail to evaluate ODIs accurately due to their 360 -degree scene representation. Addressing this, we introduce the Local - Global Transformer for 360 -degree Image Quality Assessment (LGT360IQ). This novel framework features dual branches tailored to mimic top -down and bottom -up visual attention mechanisms, adapted for the spherical characteristics of ODIs. The local branch processes tangent viewports from salient regions within the equirectangular projection image, extracting detailed features for granular quality assessment. In parallel, the global branch utilizes a task -dependent token sampling strategy for holistic image feature processing and quality score prediction. This integrated approach combines local and global information, offering an effective IQA method for ODIs. Our extensive evaluation across three benchmark ODI datasets, CVIQ, OIQA, and ODI, demonstrates LGT360IQ superior performance and establishes its role in advancing the field of IQA for omnidirectional images.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsorsThis work was supported in part by KUIS AI Center Research Award to N. Jabbari Tofighi, TUBITAK-1001 Program Award No. 120E501, and BAGEP 2021 Award of the Science Academy to A. Erdem. During the preparation of this work the author (s) used GPT-4 in order to improve language and readability. After using this tool/service, the author (s) reviewed and edited the content as needed and take (s) full responsibility for the content of the publication.
dc.description.volume148
dc.identifier.doi10.1016/j.imavis.2024.105151
dc.identifier.eissn1872-8138
dc.identifier.issn0262-8856
dc.identifier.quartileQ1
dc.identifier.scopus2-s2.0-85196951314
dc.identifier.urihttps://doi.org/10.1016/j.imavis.2024.105151
dc.identifier.urihttps://hdl.handle.net/20.500.14288/22515
dc.identifier.wos1262657000001
dc.keywords360-degree images
dc.keywordsImage quality assessment
dc.keywordsVision transformers
dc.languageen
dc.publisherElsevier
dc.sourceImage and Vision Computing
dc.subjectComputer science, artificial intelligence
dc.subjectComputer science, software engineering
dc.subjectComputer science, theory and methods
dc.subjectEngineering, electrical and electronic
dc.subjectOptics
dc.titleOmnidirectional image quality assessment with local-global vision transformers
dc.typeJournal article
dspace.entity.typePublication
local.contributor.kuauthorTofighi, Nafiseh Jabbari
local.contributor.kuauthorErdem, Aykut
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae

Files