Publication:
Adaptive classifier cascade for multimodal speaker identification

Placeholder

Program

KU Authors

Co-Authors

Advisor

Publication Date

Language

English

Journal Title

Journal ISSN

Volume Title

Abstract

We present a multimodal open-set speaker identification system that integrates information coming from audio, face and lip motion modalities. For fusion of multiple modalities, we propose a new adaptive cascade rule that favors reliable modality combinations through a cascade of classifiers. The order of the classifiers in the cascade is adaptively determined based on the reliability of each modality combination. A novel reliability measure, that genuinely fits to the open-set speaker identification problem, is also proposed to assess accept or reject decisions of a classifier. The proposed adaptive rule is more robust in the presence of unreliable modalities, and outperforms the hard-level max rule and soft-level weighted summation rule, provided that the employed reliability measure is effective in assessment of classifier decisions. Experimental results that support this assertion are provided.

Source:

8th International Conference on Spoken Language Processing, ICSLP 2004

Publisher:

International Speech Communication Association

Keywords:

Subject

Electrical electronics engineering, Computer engineering

Citation

Endorsement

Review

Supplemented By

Referenced By

Copyrights Note

0

Views

0

Downloads

View PlumX Details