Publication: Adaptive classifier cascade for multimodal speaker identification
Program
KU Authors
Co-Authors
Advisor
Publication Date
Language
English
Journal Title
Journal ISSN
Volume Title
Abstract
We present a multimodal open-set speaker identification system that integrates information coming from audio, face and lip motion modalities. For fusion of multiple modalities, we propose a new adaptive cascade rule that favors reliable modality combinations through a cascade of classifiers. The order of the classifiers in the cascade is adaptively determined based on the reliability of each modality combination. A novel reliability measure, that genuinely fits to the open-set speaker identification problem, is also proposed to assess accept or reject decisions of a classifier. The proposed adaptive rule is more robust in the presence of unreliable modalities, and outperforms the hard-level max rule and soft-level weighted summation rule, provided that the employed reliability measure is effective in assessment of classifier decisions. Experimental results that support this assertion are provided.
Source:
8th International Conference on Spoken Language Processing, ICSLP 2004
Publisher:
International Speech Communication Association
Keywords:
Subject
Electrical electronics engineering, Computer engineering