Publication: Adaptive classifier cascade for multimodal speaker identification
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
dc.contributor.kuauthor | Yemez, Yücel | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.date.accessioned | 2024-11-09T23:49:36Z | |
dc.date.issued | 2004 | |
dc.description.abstract | We present a multimodal open-set speaker identification system that integrates information coming from audio, face and lip motion modalities. For fusion of multiple modalities, we propose a new adaptive cascade rule that favors reliable modality combinations through a cascade of classifiers. The order of the classifiers in the cascade is adaptively determined based on the reliability of each modality combination. A novel reliability measure, that genuinely fits to the open-set speaker identification problem, is also proposed to assess accept or reject decisions of a classifier. The proposed adaptive rule is more robust in the presence of unreliable modalities, and outperforms the hard-level max rule and soft-level weighted summation rule, provided that the employed reliability measure is effective in assessment of classifier decisions. Experimental results that support this assertion are provided. | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.description.sponsorship | International Speech Communication Association (ISCA) | |
dc.identifier.link | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85009151471andpartnerID=40andmd5=8b08bf74073ee2e66bd3d319d28f506b | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-85009151471 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/14394 | |
dc.keywords | Classification (of information) | |
dc.keywords | Loudspeakers | |
dc.keywords | Reliability | |
dc.keywords | Adaptive classifiers | |
dc.keywords | Cascade of classifiers | |
dc.keywords | Classifier decisions | |
dc.keywords | Multi-modal speaker identification | |
dc.keywords | Open-set speaker identification system | |
dc.keywords | Reliability measure | |
dc.keywords | Speaker identification | |
dc.keywords | Weighted summations | |
dc.keywords | Speech recognition | |
dc.language.iso | eng | |
dc.publisher | International Speech Communication Association | |
dc.relation.ispartof | 8th International Conference on Spoken Language Processing, ICSLP 2004 | |
dc.subject | Electrical electronics engineering | |
dc.subject | Computer engineering | |
dc.title | Adaptive classifier cascade for multimodal speaker identification | |
dc.type | Conference Proceeding | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Tekalp, Ahmet Murat | |
local.contributor.kuauthor | Erzin, Engin | |
local.contributor.kuauthor | Yemez, Yücel | |
local.publication.orgunit1 | College of Engineering | |
local.publication.orgunit2 | Department of Electrical and Electronics Engineering | |
local.publication.orgunit2 | Department of Computer Engineering | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isParentOrgUnitOfPublication | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 | |
relation.isParentOrgUnitOfPublication.latestForDiscovery | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 |