Audiovisual synchronization and fusion using canonical correlation analysis

Publication:
Audiovisual synchronization and fusion using canonical correlation analysis

dc.contributor.department	Department of Electrical and Electronics Engineering
dc.contributor.department	Department of Computer Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.kuauthor	Erzin, Engin
dc.contributor.kuauthor	Sargın, Mehmet Emre
dc.contributor.kuauthor	Tekalp, Ahmet Murat
dc.contributor.kuauthor	Yemez, Yücel
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-09T12:29:11Z
dc.date.issued	2007
dc.description.abstract	It is well-known that early integration (also called data fusion) is effective when the modalities are correlated, and late integration (also called decision or opinion fusion) is optimal when modalities are uncorrelated. In this paper, we propose a new multimodal fusion strategy for open-set speaker identification using a combination of early and late integration following canonical correlation analysis (CCA) of speech and lip texture features. We also propose a method for high precision synchronization of the speech and lip features using CCA prior to the proposed fusion. Experimental results show that i) the proposed fusion strategy yields the best equal error rates (EER), which are used to quantify the performance of the fusion strategy for open-set speaker identification, and ii) precise synchronization prior to fusion improves the EER; hence, the best EER is obtained when the proposed synchronization scheme is employed together with the proposed fusion strategy. We note that the proposed fusion strategy outperforms others because the features used in the late integration are truly uncorrelated, since they are output of the CCA analysis.
dc.description.fulltext	YES
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.issue	7
dc.description.openaccess	YES
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	EU
dc.description.sponsorship	European FP6 Network of Excellence SIMILAR
dc.description.version	Author's final manuscript
dc.description.volume	9
dc.identifier.doi	10.1109/TMM.2007.906583
dc.identifier.embargo	NO
dc.identifier.filenameinventoryno	IR01073
dc.identifier.issn	1520-9210
dc.identifier.quartile	Q1
dc.identifier.scopus	2-s2.0-57549101447
dc.identifier.uri	https://doi.org/10.1109/TMM.2007.906583
dc.identifier.wos	250447400006
dc.keywords	Information systems
dc.keywords	Software engineering
dc.keywords	Telecommunications
dc.keywords	Audiovisual synchronization
dc.keywords	Correlation
dc.keywords	Multimodal
dc.keywords	Fusion
dc.keywords	Speaker recognition
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof	IEEE Transactions on Multimedia
dc.relation.uri	http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/6100
dc.subject	Computer science
dc.title	Audiovisual synchronization and fusion using canonical correlation analysis
dc.type	Journal Article
dspace.entity.type	Publication
local.contributor.kuauthor	Sargın, Mehmet Emre
local.contributor.kuauthor	Yemez, Yücel
local.contributor.kuauthor	Erzin, Engin
local.contributor.kuauthor	Tekalp, Ahmet Murat
local.publication.orgunit1	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1	College of Engineering
local.publication.orgunit2	Department of Computer Engineering
local.publication.orgunit2	Department of Electrical and Electronics Engineering
local.publication.orgunit2	Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 6100.pdf
Size:: 262.72 KB
Format:: Adobe Portable Document Format

Download

Collections

Publications with Fulltext

Publication: Audiovisual synchronization and fusion using canonical correlation analysis

Files

Original bundle

Collections

Publication:
Audiovisual synchronization and fusion using canonical correlation analysis