Publication:
Improving throat microphone speech recognition by joint analysis of throat and acoustic microphone recordings

dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorErzin, Engin
dc.contributor.kuprofileFaculty Member
dc.contributor.otherDepartment of Computer Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.yokid34503
dc.date.accessioned2024-11-09T23:25:32Z
dc.date.issued2009
dc.description.abstractWe present a new framework for joint analysis of throat and acoustic microphone (TaM) recordings to improve throat microphone only speech recognition. the proposed analysis framework aims to learn joint sub-phone patterns of throat and acoustic microphone recordings through a parallel branch HMM structure. the joint sub-phone patterns define temporally correlated neighborhoods, in which a linear prediction filter estimates a spectrally rich acoustic feature vector from throat feature vectors. Multimodal speech recognition with throat and throat-driven acoustic features significantly improves throat-only speech recognition performance. Experimental evaluations on a parallel TaM database yield benchmark phoneme recognition rates for throat-only and multimodal TaM speech recognition systems as 46.81% and 60.69%, respectively. the proposed throat-driven multimodal speech recognition system improves phoneme recognition rate to 52.58%, A significant relative improvement with respect to the throat-only speech recognition benchmark system.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.issue7
dc.description.openaccessNO
dc.description.publisherscopeInternational
dc.description.volume17
dc.identifier.doi10.1109/TaSL.2009.2016733
dc.identifier.eissn1558-7924
dc.identifier.issn1558-7916
dc.identifier.quartileQ2
dc.identifier.scopus2-s2.0-68549110984
dc.identifier.urihttp://dx.doi.org/10.1109/TaSL.2009.2016733
dc.identifier.urihttps://hdl.handle.net/20.500.14288/11379
dc.identifier.wos268172100006
dc.keywordsJoint processing of throat and acoustic microphone (TaM) recordings
dc.keywordsRobust speech recognition
dc.keywordsThroat microphone speech recognition
dc.languageEnglish
dc.publisherIEEE-inst Electrical Electronics Engineers inc
dc.sourceIEEE Transactions on Audio Speech and Language Processing
dc.subjectAcoustics
dc.subjectElectrical electronics engineering
dc.titleImproving throat microphone speech recognition by joint analysis of throat and acoustic microphone recordings
dc.typeJournal Article
dspace.entity.typePublication
local.contributor.authorid0000-0002-2715-2368
local.contributor.kuauthorErzin, Engin
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae

Files