Publication:
ST360IQ: no-reference omnidirectional image quality assessment with spherical vision transformers

dc.contributor.coauthorHedi Elfkir, Mohamed
dc.contributor.coauthorİmamoğlu, Nevrez
dc.contributor.coauthorÖzçınar, Çağrı
dc.contributor.coauthorErdem, Erkut
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorTofighi, Nafiseh Jabbari
dc.contributor.kuauthorErdem, Aykut
dc.contributor.researchcenterKUIS AI (Koç University & İş Bank Artificial Intelligence Center)
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.unit 
dc.date.accessioned2024-12-29T09:37:43Z
dc.date.issued2023
dc.description.abstractOmnidirectional images, aka 360° images, can deliver immersive and interactive visual experiences. As their popularity has increased dramatically in recent years, evaluating the quality of 360° images has become a problem of interest since it provides insights for capturing, transmitting, and consuming this new media. However, directly adapting quality assessment methods proposed for standard natural images for omnidirectional data poses certain challenges. These models need to deal with very high-resolution data and implicit distortions due to the spherical form of the images. In this study, we present a method for no-reference 360° image quality assessment. Our proposed ST360IQ model extracts tangent viewports from the salient parts of the input omnidirectional image and employs a vision-transformers based module processing saliency selective patches/tokens that estimates a quality score from each viewport. Then, it aggregates these scores to give a final quality score. Our experiments on two benchmark datasets, namely OIQA and CVIQ datasets, demonstrate that as compared to the state-of-the-art, our approach predicts the quality of an omnidirectional image correlated with the human-perceived image quality. The code has been available on https://github.com/Nafiseh-Tofighi/ST360IQ © 2023 IEEE.
dc.description.indexedbyScopus
dc.description.openaccessAll Open Access
dc.description.openaccessGreen Open Access
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsors 
dc.identifier.doi10.1109/ICASSP49357.2023.10096750
dc.identifier.eissn 
dc.identifier.issn1520-6149
dc.identifier.link 
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85177069079
dc.identifier.urihttps://doi.org/10.1109/ICASSP49357.2023.10096750
dc.identifier.urihttps://hdl.handle.net/20.500.14288/22464
dc.keywords360° image quality assessment
dc.keywordsVision transformers
dc.languageen
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.relation.grantno 
dc.rights 
dc.sourceICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
dc.subjectImage quality assessment
dc.subjectReference image
dc.subjectQuality of service
dc.titleST360IQ: no-reference omnidirectional image quality assessment with spherical vision transformers
dc.typeConference proceeding
dc.type.other 
dspace.entity.typePublication
local.contributor.kuauthorTofighi, Nafiseh Jabbari
local.contributor.kuauthorErdem, Aykut
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
IR05123.pdf
Size:
433.67 KB
Format:
Adobe Portable Document Format