Publication:
A new statistical excitation mapping for enhancement of throat microphone recordings

dc.contributor.departmentN/A
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorTuran, Mehmet Ali Tuğtekin
dc.contributor.kuauthorErzin, Engin
dc.contributor.kuprofilePhD Student
dc.contributor.kuprofileFaculty Member
dc.contributor.otherDepartment of Computer Engineering
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.yokidN/A
dc.contributor.yokid34503
dc.date.accessioned2024-11-10T00:10:29Z
dc.date.issued2013
dc.description.abstractIn this paper we investigate a new statistical excitation mapping technique to enhance throat-microphone speech using joint analysis of throat- And acoustic-microphone recordings. In a recent study we employed source-filter decomposition to enhance spectral envelope of the throat-microphone recordings. In the source-filter decomposition framework we observed that the spectral envelope difference of the excitation signals of throatand acoustic-microphone recordings is an important source of the degradation in the throat-microphone voice quality. In this study we model spectral envelope difference of the excitation signals as a spectral tilt vector, and we propose a new phone-dependent GMM-based spectral tilt mapping scheme to enhance throat excitation signal. Experiments are performed to evaluate the proposed excitation mapping scheme in comparison with the state-of-the-art throat-microphone speech enhancement techniques using both objective and subjective evaluations. Objective evaluations are performed with the wideband perceptual evaluation of speech quality (ITU-PESQ) metric. Subjective evaluations are performed with the A/B pair comparison listening test. Both objective and subjective evaluations yield that the proposed statistical excitation mapping consistently delivers higher improvements than the statistical mapping of the spectral envelope to enhance the throat-microphone recordings.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.sponsorshipAmazon
dc.description.sponsorshipet al.
dc.description.sponsorshipEuropean Language Resources Association (ELRA)
dc.description.sponsorshipGoogle
dc.description.sponsorshipMicrosoft
dc.description.sponsorshipSytral
dc.identifier.doiN/A
dc.identifier.issn2308-457X
dc.identifier.linkhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84906276857&partnerID=40&md5=9973c3b7825b78e1b2863191e5fe31e6
dc.identifier.scopus2-s2.0-84906276857
dc.identifier.uriN/A
dc.identifier.urihttps://hdl.handle.net/20.500.14288/17314
dc.identifier.wos395050001193
dc.keywordsExcitation mapping
dc.keywordsGMM mapping
dc.keywordsSpectral envelope mapping
dc.keywordsSpeech enhancement
dc.keywordsThroat-microphone Microphones
dc.keywordsPhotomapping
dc.keywordsQuality control
dc.keywordsSpeech enhancement
dc.keywordsMapping techniques
dc.keywordsObjective and subjective evaluations
dc.keywordsObjective evaluation
dc.keywordsPerceptual evaluation of speech qualities
dc.keywordsSpectral envelopes
dc.keywordsSubjective evaluations
dc.keywordsThroat microphones
dc.keywordsThroat-microphone
dc.keywordsAudio recordings
dc.languageEnglish
dc.publisherInternational Speech and Communication Association
dc.sourceProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
dc.subjectComputer Science
dc.subjectArtificial intelligence
dc.subjectElectrical electronics engineering
dc.titleA new statistical excitation mapping for enhancement of throat microphone recordings
dc.typeConference proceeding
dspace.entity.typePublication
local.contributor.authorid0000-0002-3822-235X
local.contributor.authorid0000-0002-2715-2368
local.contributor.kuauthorTuran, Mehmet Ali Tuğtekin
local.contributor.kuauthorErzin, Engin
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae

Files