Publication:
Role of audio in video summarization

dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorShoer, İbrahim
dc.contributor.kuauthorKöprü, Berkay
dc.contributor.kuauthorErzin, Engin
dc.contributor.otherDepartment of Computer Engineering
dc.contributor.researchcenterKoç Üniversitesi İş Bankası Yapay Zeka Uygulama ve Araştırma Merkezi (KUIS AI)/ Koç University İş Bank Artificial Intelligence Center (KUIS AI)
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-12-29T09:36:02Z
dc.date.issued2023
dc.description.abstractVideo summarization attracts attention for efficient video representation, retrieval, and browsing to ease volume and traffic surge problems. Although video summarization mostly uses the visual channel for compaction, the benefits of audio-visual modeling appeared in recent literature. The information coming from the audio channel can be a result of audio-visual correlation in the video content. In this study, we propose a new audio-visual video summarization framework integrating four ways of audio-visual information fusion with GRU-based and attention-based networks. Furthermore, we investigate a new explainability methodology using audio-visual canonical correlation analysis (CCA) to better understand and explain the role of audio in the video summarization task. Experimental evaluations on the TVSum dataset attain F1 score and Kendall-tau score improvements for the audio-visual video summarization. Furthermore, splitting video content on TVSum and COGNIMUSE datasets based on audio-visual CCA as positively and negatively correlated videos yields a strong performance improvement over the positively correlated videos for audio-only and audio-visual video summarization.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.publisherscopeInternational
dc.identifier.doi10.1109/ICASSPW59220.2023.10192578
dc.identifier.isbn979-8-3503-0261-5
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85168238207
dc.identifier.urihttps://doi.org/10.1109/ICASSPW59220.2023.10192578
dc.identifier.urihttps://hdl.handle.net/20.500.14288/21907
dc.identifier.wos1046933700001
dc.keywordsAudio-visual video summarization
dc.keywordsCanonical correlation analysis
dc.languageen
dc.publisherIEEE
dc.source2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW
dc.subjectAcoustics
dc.subjectComputer science
dc.subjectInterdisciplinary applications
dc.subjectElectrical engineering
dc.subjectElectronic engineering
dc.subjectImaging science and photographic technology
dc.titleRole of audio in video summarization
dc.typeConference proceeding
dspace.entity.typePublication
local.contributor.kuauthorShoer, İbrahim
local.contributor.kuauthorKöprü, Berkay
local.contributor.kuauthorErzin, Engin
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae

Files