Publication:
Adaptive classifier cascade for multimodal speaker identification

dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorErzin, Engin
dc.contributor.kuauthorTekalp, Ahmet Murat
dc.contributor.kuauthorYemez, Yücel
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-11-09T23:49:36Z
dc.date.issued2004
dc.description.abstractWe present a multimodal open-set speaker identification system that integrates information coming from audio, face and lip motion modalities. For fusion of multiple modalities, we propose a new adaptive cascade rule that favors reliable modality combinations through a cascade of classifiers. The order of the classifiers in the cascade is adaptively determined based on the reliability of each modality combination. A novel reliability measure, that genuinely fits to the open-set speaker identification problem, is also proposed to assess accept or reject decisions of a classifier. The proposed adaptive rule is more robust in the presence of unreliable modalities, and outperforms the hard-level max rule and soft-level weighted summation rule, provided that the employed reliability measure is effective in assessment of classifier decisions. Experimental results that support this assertion are provided.
dc.description.indexedbyScopus
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuN/A
dc.description.sponsorshipInternational Speech Communication Association (ISCA)
dc.identifier.linkhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85009151471andpartnerID=40andmd5=8b08bf74073ee2e66bd3d319d28f506b
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85009151471
dc.identifier.urihttps://hdl.handle.net/20.500.14288/14394
dc.keywordsClassification (of information)
dc.keywordsLoudspeakers
dc.keywordsReliability
dc.keywordsAdaptive classifiers
dc.keywordsCascade of classifiers
dc.keywordsClassifier decisions
dc.keywordsMulti-modal speaker identification
dc.keywordsOpen-set speaker identification system
dc.keywordsReliability measure
dc.keywordsSpeaker identification
dc.keywordsWeighted summations
dc.keywordsSpeech recognition
dc.language.isoeng
dc.publisherInternational Speech Communication Association
dc.relation.ispartof8th International Conference on Spoken Language Processing, ICSLP 2004
dc.subjectElectrical electronics engineering
dc.subjectComputer engineering
dc.titleAdaptive classifier cascade for multimodal speaker identification
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorTekalp, Ahmet Murat
local.contributor.kuauthorErzin, Engin
local.contributor.kuauthorYemez, Yücel
local.publication.orgunit1College of Engineering
local.publication.orgunit2Department of Electrical and Electronics Engineering
local.publication.orgunit2Department of Computer Engineering
relation.isOrgUnitOfPublication21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files