Publication:
ST360IQ: no-reference omnidirectional image quality assessment with spherical vision transformers

dc.contributor.coauthorHedi Elfkir, Mohamed
dc.contributor.coauthorİmamoğlu, Nevrez
dc.contributor.coauthorÖzçınar, Çağrı
dc.contributor.coauthorErdem, Erkut
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.departmentKUIS AI (Koç University & İş Bank Artificial Intelligence Center)
dc.contributor.kuauthorErdem, Aykut
dc.contributor.kuauthorTofighi, Nafiseh Jabbari
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.contributor.schoolcollegeinstituteResearch Center
dc.date.accessioned2024-12-29T09:37:43Z
dc.date.issued2023
dc.description.abstractOmnidirectional images, aka 360° images, can deliver immersive and interactive visual experiences. As their popularity has increased dramatically in recent years, evaluating the quality of 360° images has become a problem of interest since it provides insights for capturing, transmitting, and consuming this new media. However, directly adapting quality assessment methods proposed for standard natural images for omnidirectional data poses certain challenges. These models need to deal with very high-resolution data and implicit distortions due to the spherical form of the images. In this study, we present a method for no-reference 360° image quality assessment. Our proposed ST360IQ model extracts tangent viewports from the salient parts of the input omnidirectional image and employs a vision-transformers based module processing saliency selective patches/tokens that estimates a quality score from each viewport. Then, it aggregates these scores to give a final quality score. Our experiments on two benchmark datasets, namely OIQA and CVIQ datasets, demonstrate that as compared to the state-of-the-art, our approach predicts the quality of an omnidirectional image correlated with the human-perceived image quality. The code has been available on https://github.com/Nafiseh-Tofighi/ST360IQ © 2023 IEEE.
dc.description.indexedbyScopus
dc.description.openaccessAll Open Access
dc.description.openaccessGreen Open Access
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsorship 
dc.identifier.doi10.1109/ICASSP49357.2023.10096750
dc.identifier.eissn 
dc.identifier.issn1520-6149
dc.identifier.link 
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85177069079
dc.identifier.urihttps://doi.org/10.1109/ICASSP49357.2023.10096750
dc.identifier.urihttps://hdl.handle.net/20.500.14288/22464
dc.keywords360° image quality assessment
dc.keywordsVision transformers
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.relation.grantno 
dc.relation.ispartofICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
dc.rights 
dc.subjectImage quality assessment
dc.subjectReference image
dc.subjectQuality of service
dc.titleST360IQ: no-reference omnidirectional image quality assessment with spherical vision transformers
dc.typeConference Proceeding
dc.type.other 
dspace.entity.typePublication
local.contributor.kuauthorTofighi, Nafiseh Jabbari
local.contributor.kuauthorErdem, Aykut
local.publication.orgunit1GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1College of Engineering
local.publication.orgunit1Research Center
local.publication.orgunit2Department of Computer Engineering
local.publication.orgunit2KUIS AI (Koç University & İş Bank Artificial Intelligence Center)
local.publication.orgunit2Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication77d67233-829b-4c3a-a28f-bd97ab5c12c7
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublicationd437580f-9309-4ecb-864a-4af58309d287
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
IR05123.pdf
Size:
433.67 KB
Format:
Adobe Portable Document Format