Multimodal speaker identification using canonical correlation analysis

Publication:
Multimodal speaker identification using canonical correlation analysis

dc.contributor.department	Department of Electrical and Electronics Engineering
dc.contributor.department	Department of Computer Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.kuauthor	Faculty Member, Erzin, Engin
dc.contributor.kuauthor	Master Student, Sargın, Mehmet Emre
dc.contributor.kuauthor	Faculty Member, Tekalp, Ahmet Murat
dc.contributor.kuauthor	Faculty Member, Yemez, Yücel
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-09T22:53:07Z
dc.date.issued	2006
dc.description.abstract	In this work, we explore the use of canonical correlation analysis to improve the performance of multimodal recognition systems that involve multiple correlated modalities. More specifically, we consider the audiovisual speaker identification problem, where speech and lip texture (or intensity) modalities are fused in an open-set identification framework. Our motivation is based on the following observation. The late integration strategy, which is also referred to as decision or opinion fusion, is effective especially in case the contributing modalities are uncorrelated and thus the resulting partial decisions are statistically independent. Early integration techniques on the other hand can be favored only if a couple of modalities are highly correlated. However, coupled modalities such as audio and lip texture also consist of some components that are mutually independent. Thus we first perform a cross-correlation analysis on the audio and lip modalities so as to extract the correlated part of the information, and then employ an optimal combination of early and late integration techniques to fuse the extracted features. The results of the experiments testing the performance of the proposed system are also provided.
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.openaccess	NO
dc.description.sponsoredbyTubitakEu	N/A
dc.identifier.isbn	978-1-4244-0468-1
dc.identifier.issn	1520-6149
dc.identifier.scopus	2-s2.0-33947376189
dc.identifier.uri	https://hdl.handle.net/20.500.14288/7146
dc.identifier.wos	245559901036
dc.language.iso	eng
dc.publisher	IEEE
dc.relation.ispartof	2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13
dc.subject	Acoustics
dc.subject	Computer Science
dc.subject	Artificial intelligence
dc.subject	Computer science
dc.subject	Software Electrical electronics engineering engineering
dc.title	Multimodal speaker identification using canonical correlation analysis
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Sargın, Mehmet Emre
local.contributor.kuauthor	Erzin, Engin
local.contributor.kuauthor	Yemez, Yücel
local.contributor.kuauthor	Tekalp, Ahmet Murat
local.publication.orgunit1	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1	College of Engineering
local.publication.orgunit2	Department of Computer Engineering
local.publication.orgunit2	Department of Electrical and Electronics Engineering
local.publication.orgunit2	Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Collections

Publications without Fulltext

Publication: Multimodal speaker identification using canonical correlation analysis

Files

Collections

Publication:
Multimodal speaker identification using canonical correlation analysis