Publication:
Multimodal speaker identification using an adaptive classifier cascade based on modality reliability

dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorErzin, Engin
dc.contributor.kuauthorTekalp, Ahmet Murat
dc.contributor.kuauthorYemez, Yücel
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-11-09T22:50:17Z
dc.date.issued2005
dc.description.abstractWe present a multimodal open-set speaker identification system that integrates information coming from audio, face and lip motion modalities. For fusion of multiple modalities, we propose a new adaptive cascade rule that favors reliable modality combinations through a cascade of classifiers. The order of the classifiers in the cascade is adaptively determined based on the reliability of each modality combination. A novel reliability measure, that genuinely fits to the open-set speaker identification problem, is also proposed to assess accept or reject decisions of a classifier. A formal framework is developed based on probability of correct decision for analytical comparison of the proposed adaptive rule with other classifier combination rules. The proposed adaptive rule is more robust in the presence of unreliable modalities, and outperforms the hard-level max rule and soft-level weighted summation rule, provided that the employed reliability measure is effective in assessment of classifier decisions. Experimental results that support this assertion are provided.
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.issue5
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuN/A
dc.description.volume7
dc.identifier.doi10.1109/TMM.2005.854464
dc.identifier.eissn1941-0077
dc.identifier.issn1520-9210
dc.identifier.quartileQ1
dc.identifier.scopus2-s2.0-26844533276
dc.identifier.urihttps://doi.org/10.1109/TMM.2005.854464
dc.identifier.urihttps://hdl.handle.net/20.500.14288/6649
dc.identifier.wos232084900005
dc.keywordsClassifier combining
dc.keywordsModality reliability
dc.keywordsMultimodal speaker identification fusion
dc.keywordsFace
dc.keywordsCombination
dc.keywordsInformation
dc.keywordsSpeech
dc.keywordsVerification
dc.keywordsRecognition
dc.language.isoeng
dc.publisherIEEE-Inst Electrical Electronics Engineers Inc
dc.relation.ispartofIEEE Transactions on Multimedia
dc.subjectComputer science
dc.subjectInformation systems
dc.subjectEngineering
dc.subjectSoftware engineering
dc.subjectTelecommunications
dc.titleMultimodal speaker identification using an adaptive classifier cascade based on modality reliability
dc.typeJournal Article
dspace.entity.typePublication
local.contributor.kuauthorErzin, Engin
local.contributor.kuauthorYemez, Yücel
local.contributor.kuauthorTekalp, Ahmet Murat
local.publication.orgunit1College of Engineering
local.publication.orgunit2Department of Computer Engineering
local.publication.orgunit2Department of Electrical and Electronics Engineering
relation.isOrgUnitOfPublication21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files