Publication:
Improving throat microphone speech recognition by joint analysis of throat and acoustic microphone recordings

dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorErzin, Engin
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-11-09T23:25:32Z
dc.date.issued2009
dc.description.abstractWe present a new framework for joint analysis of throat and acoustic microphone (TaM) recordings to improve throat microphone only speech recognition. the proposed analysis framework aims to learn joint sub-phone patterns of throat and acoustic microphone recordings through a parallel branch HMM structure. the joint sub-phone patterns define temporally correlated neighborhoods, in which a linear prediction filter estimates a spectrally rich acoustic feature vector from throat feature vectors. Multimodal speech recognition with throat and throat-driven acoustic features significantly improves throat-only speech recognition performance. Experimental evaluations on a parallel TaM database yield benchmark phoneme recognition rates for throat-only and multimodal TaM speech recognition systems as 46.81% and 60.69%, respectively. the proposed throat-driven multimodal speech recognition system improves phoneme recognition rate to 52.58%, A significant relative improvement with respect to the throat-only speech recognition benchmark system.
dc.description.fulltextNo
dc.description.harvestedfromManual
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.openaccessNO
dc.description.peerreviewstatusN/A
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuN/A
dc.description.versionN/A
dc.identifier.doi10.1109/TaSL.2009.2016733
dc.identifier.eissn1558-7924
dc.identifier.embargoN/A
dc.identifier.issn1558-7916
dc.identifier.quartileQ2
dc.identifier.scopus2-s2.0-68549110984
dc.identifier.urihttps://doi.org/10.1109/TaSL.2009.2016733
dc.identifier.urihttps://hdl.handle.net/20.500.14288/11379
dc.identifier.wos268172100006
dc.keywordsJoint processing of throat and acoustic microphone (TaM) recordings
dc.keywordsRobust speech recognition
dc.keywordsThroat microphone speech recognition
dc.language.isoeng
dc.publisherIEEE-inst Electrical Electronics Engineers inc
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartofIEEE Transactions on Audio Speech and Language Processing
dc.relation.openaccessN/A
dc.rightsN/A
dc.subjectAcoustics
dc.subjectElectrical electronics engineering
dc.titleImproving throat microphone speech recognition by joint analysis of throat and acoustic microphone recordings
dc.typeJournal Article
dspace.entity.typePublication
local.contributor.kuauthorErzin, Engin
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files