Publication: Multimodal speaker identification using discriminative lip motion features
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | N/A | |
dc.contributor.kuauthor | Tekalp, Ahmet Murat | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuauthor | Yemez, Yücel | |
dc.contributor.kuauthor | Çetingül, Hasan Ertan | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Master Student | |
dc.contributor.other | Department of Electrical and Electronics Engineering | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.yokid | 26207 | |
dc.contributor.yokid | 34503 | |
dc.contributor.yokid | 107907 | |
dc.contributor.yokid | N/A | |
dc.date.accessioned | 2024-11-09T23:35:37Z | |
dc.date.issued | 2009 | |
dc.description.abstract | This chapter presents a multimodal speaker identification system that integrates audio, lip texture, and lip motion modalities, and the authors propose to use the "explicit" lip motion information that best represent the modality for the given problem. The work is presented in two stages: First, they consider several lip motion feature candidates such as dense motion features on the lip region, motion features on the outer lip contour, and lip shape features. Meanwhile, the authors introduce their main contribution, which is a novel two-stage, spatial-temporal discrimination analysis framework designed to obtain the best lip motion features. For speaker identification, the best lip motion features result in the highest discrimination among speakers. Next, they investigate the benefits of the inclusion of the best lip motion features for multimodal recognition. Audio, lip texture, and lip motion modalities are fused by the reliability weighted summation (RWS) decision rule, and hidden Markov model (HMM)-based modeling is performed for both unimodal and multimodal recognition. Experimental results indicate that discriminative grid-based lip motion features are proved to be more valuable and provide additional performance gains in speaker identification. © 2009, IGI Global. | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.identifier.doi | 10.4018/978-1-60566-186-5.ch016 | |
dc.identifier.isbn | 9781-6056-6186-5 | |
dc.identifier.link | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84900179389anddoi=10.4018%2f978-1-60566-186-5.ch016andpartnerID=40andmd5=fcb17d7d71b78420819c86c412554530 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-84900179389 | |
dc.identifier.uri | http://dx.doi.org/10.4018/978-1-60566-186-5.ch016 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/12531 | |
dc.keywords | N/A | |
dc.language | English | |
dc.publisher | IGI Global | |
dc.source | Visual Speech Recognition: Lip Segmentation and Mapping | |
dc.subject | Electrical electronics engineering | |
dc.subject | Computer engineering | |
dc.title | Multimodal speaker identification using discriminative lip motion features | |
dc.type | Book Chapter | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0003-1465-8121 | |
local.contributor.authorid | 0000-0002-2715-2368 | |
local.contributor.authorid | 0000-0002-7515-3138 | |
local.contributor.authorid | N/A | |
local.contributor.kuauthor | Tekalp, Ahmet Murat | |
local.contributor.kuauthor | Erzin, Engin | |
local.contributor.kuauthor | Yemez, Yücel | |
local.contributor.kuauthor | Çetingül, Hasan Ertan | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 |