Multimodal speaker identification using an adaptive classifier cascade based on modality reliability

Publication:
Multimodal speaker identification using an adaptive classifier cascade based on modality reliability

Departments

Organizational Unit

Department of Electrical and Electronics Engineering

Organizational Unit

Department of Computer Engineering

School / College / Institute

Organizational Unit

College of Engineering

KU-Authors

Erzin, Engin

Tekalp, Ahmet Murat

Yemez, Yücel

Publication Date

2005

Type

Journal Article

Abstract

We present a multimodal open-set speaker identification system that integrates information coming from audio, face and lip motion modalities. For fusion of multiple modalities, we propose a new adaptive cascade rule that favors reliable modality combinations through a cascade of classifiers. The order of the classifiers in the cascade is adaptively determined based on the reliability of each modality combination. A novel reliability measure, that genuinely fits to the open-set speaker identification problem, is also proposed to assess accept or reject decisions of a classifier. A formal framework is developed based on probability of correct decision for analytical comparison of the proposed adaptive rule with other classifier combination rules. The proposed adaptive rule is more robust in the presence of unreliable modalities, and outperforms the hard-level max rule and soft-level weighted summation rule, provided that the employed reliability measure is effective in assessment of classifier decisions. Experimental results that support this assertion are provided.

Publisher

IEEE-Inst Electrical Electronics Engineers Inc

Subject

Computer science, Information systems, Engineering, Software engineering, Telecommunications

Source

IEEE Transactions on Multimedia

DOI

10.1109/TMM.2005.854464

URI

https://doi.org/10.1109/TMM.2005.854464
https://hdl.handle.net/20.500.14288/6649

Publication:
Multimodal speaker identification using an adaptive classifier cascade based on modality reliability

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Publication Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

0

Views

0

Downloads

Publication: Multimodal speaker identification using an adaptive classifier cascade based on modality reliability

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Publication Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

0

Views

0

Downloads

Publication:
Multimodal speaker identification using an adaptive classifier cascade based on modality reliability