Publication: Omnidirectional image quality assessment with local-global vision transformers
dc.contributor.coauthor | Elfkir, Mohamed Hedi | |
dc.contributor.coauthor | Imamoglu, Nevrez | |
dc.contributor.coauthor | Ozcinar, Cagri | |
dc.contributor.coauthor | Erdem, Erkut | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Tofighi, Nafiseh Jabbari | |
dc.contributor.kuauthor | Erdem, Aykut | |
dc.contributor.researchcenter | KUIS AI (Koç University & İş Bank Artificial Intelligence Center) | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.date.accessioned | 2024-12-29T09:37:55Z | |
dc.date.issued | 2024 | |
dc.description.abstract | With the rising popularity of omnidirectional images (ODIs) in virtual reality applications, the need for specialized image quality assessment (IQA) methods becomes increasingly critical. Traditional IQA approaches, designed for rectilinear images, often fail to evaluate ODIs accurately due to their 360 -degree scene representation. Addressing this, we introduce the Local - Global Transformer for 360 -degree Image Quality Assessment (LGT360IQ). This novel framework features dual branches tailored to mimic top -down and bottom -up visual attention mechanisms, adapted for the spherical characteristics of ODIs. The local branch processes tangent viewports from salient regions within the equirectangular projection image, extracting detailed features for granular quality assessment. In parallel, the global branch utilizes a task -dependent token sampling strategy for holistic image feature processing and quality score prediction. This integrated approach combines local and global information, offering an effective IQA method for ODIs. Our extensive evaluation across three benchmark ODI datasets, CVIQ, OIQA, and ODI, demonstrates LGT360IQ superior performance and establishes its role in advancing the field of IQA for omnidirectional images. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | TÜBİTAK | |
dc.description.sponsors | This work was supported in part by KUIS AI Center Research Award to N. Jabbari Tofighi, TUBITAK-1001 Program Award No. 120E501, and BAGEP 2021 Award of the Science Academy to A. Erdem. During the preparation of this work the author (s) used GPT-4 in order to improve language and readability. After using this tool/service, the author (s) reviewed and edited the content as needed and take (s) full responsibility for the content of the publication. | |
dc.description.volume | 148 | |
dc.identifier.doi | 10.1016/j.imavis.2024.105151 | |
dc.identifier.eissn | 1872-8138 | |
dc.identifier.issn | 0262-8856 | |
dc.identifier.quartile | Q1 | |
dc.identifier.scopus | 2-s2.0-85196951314 | |
dc.identifier.uri | https://doi.org/10.1016/j.imavis.2024.105151 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/22515 | |
dc.identifier.wos | 1262657000001 | |
dc.keywords | 360-degree images | |
dc.keywords | Image quality assessment | |
dc.keywords | Vision transformers | |
dc.language | en | |
dc.publisher | Elsevier | |
dc.source | Image and Vision Computing | |
dc.subject | Computer science, artificial intelligence | |
dc.subject | Computer science, software engineering | |
dc.subject | Computer science, theory and methods | |
dc.subject | Engineering, electrical and electronic | |
dc.subject | Optics | |
dc.title | Omnidirectional image quality assessment with local-global vision transformers | |
dc.type | Journal article | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Tofighi, Nafiseh Jabbari | |
local.contributor.kuauthor | Erdem, Aykut | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |