Publication: Improving throat microphone speech recognition by joint analysis of throat and acoustic microphone recordings
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | 34503 | |
dc.date.accessioned | 2024-11-09T23:25:32Z | |
dc.date.issued | 2009 | |
dc.description.abstract | We present a new framework for joint analysis of throat and acoustic microphone (TaM) recordings to improve throat microphone only speech recognition. the proposed analysis framework aims to learn joint sub-phone patterns of throat and acoustic microphone recordings through a parallel branch HMM structure. the joint sub-phone patterns define temporally correlated neighborhoods, in which a linear prediction filter estimates a spectrally rich acoustic feature vector from throat feature vectors. Multimodal speech recognition with throat and throat-driven acoustic features significantly improves throat-only speech recognition performance. Experimental evaluations on a parallel TaM database yield benchmark phoneme recognition rates for throat-only and multimodal TaM speech recognition systems as 46.81% and 60.69%, respectively. the proposed throat-driven multimodal speech recognition system improves phoneme recognition rate to 52.58%, A significant relative improvement with respect to the throat-only speech recognition benchmark system. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.issue | 7 | |
dc.description.openaccess | NO | |
dc.description.publisherscope | International | |
dc.description.volume | 17 | |
dc.identifier.doi | 10.1109/TaSL.2009.2016733 | |
dc.identifier.eissn | 1558-7924 | |
dc.identifier.issn | 1558-7916 | |
dc.identifier.quartile | Q2 | |
dc.identifier.scopus | 2-s2.0-68549110984 | |
dc.identifier.uri | http://dx.doi.org/10.1109/TaSL.2009.2016733 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/11379 | |
dc.identifier.wos | 268172100006 | |
dc.keywords | Joint processing of throat and acoustic microphone (TaM) recordings | |
dc.keywords | Robust speech recognition | |
dc.keywords | Throat microphone speech recognition | |
dc.language | English | |
dc.publisher | IEEE-inst Electrical Electronics Engineers inc | |
dc.source | IEEE Transactions on Audio Speech and Language Processing | |
dc.subject | Acoustics | |
dc.subject | Electrical electronics engineering | |
dc.title | Improving throat microphone speech recognition by joint analysis of throat and acoustic microphone recordings | |
dc.type | Journal Article | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0002-2715-2368 | |
local.contributor.kuauthor | Erzin, Engin | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |