Publication: A new statistical excitation mapping for enhancement of throat microphone recordings
dc.contributor.department | N/A | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Turan, Mehmet Ali Tuğtekin | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuprofile | PhD Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 34503 | |
dc.date.accessioned | 2024-11-10T00:10:29Z | |
dc.date.issued | 2013 | |
dc.description.abstract | In this paper we investigate a new statistical excitation mapping technique to enhance throat-microphone speech using joint analysis of throat- And acoustic-microphone recordings. In a recent study we employed source-filter decomposition to enhance spectral envelope of the throat-microphone recordings. In the source-filter decomposition framework we observed that the spectral envelope difference of the excitation signals of throatand acoustic-microphone recordings is an important source of the degradation in the throat-microphone voice quality. In this study we model spectral envelope difference of the excitation signals as a spectral tilt vector, and we propose a new phone-dependent GMM-based spectral tilt mapping scheme to enhance throat excitation signal. Experiments are performed to evaluate the proposed excitation mapping scheme in comparison with the state-of-the-art throat-microphone speech enhancement techniques using both objective and subjective evaluations. Objective evaluations are performed with the wideband perceptual evaluation of speech quality (ITU-PESQ) metric. Subjective evaluations are performed with the A/B pair comparison listening test. Both objective and subjective evaluations yield that the proposed statistical excitation mapping consistently delivers higher improvements than the statistical mapping of the spectral envelope to enhance the throat-microphone recordings. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsorship | Amazon | |
dc.description.sponsorship | et al. | |
dc.description.sponsorship | European Language Resources Association (ELRA) | |
dc.description.sponsorship | ||
dc.description.sponsorship | Microsoft | |
dc.description.sponsorship | Sytral | |
dc.identifier.doi | N/A | |
dc.identifier.issn | 2308-457X | |
dc.identifier.link | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84906276857&partnerID=40&md5=9973c3b7825b78e1b2863191e5fe31e6 | |
dc.identifier.scopus | 2-s2.0-84906276857 | |
dc.identifier.uri | N/A | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/17314 | |
dc.identifier.wos | 395050001193 | |
dc.keywords | Excitation mapping | |
dc.keywords | GMM mapping | |
dc.keywords | Spectral envelope mapping | |
dc.keywords | Speech enhancement | |
dc.keywords | Throat-microphone Microphones | |
dc.keywords | Photomapping | |
dc.keywords | Quality control | |
dc.keywords | Speech enhancement | |
dc.keywords | Mapping techniques | |
dc.keywords | Objective and subjective evaluations | |
dc.keywords | Objective evaluation | |
dc.keywords | Perceptual evaluation of speech qualities | |
dc.keywords | Spectral envelopes | |
dc.keywords | Subjective evaluations | |
dc.keywords | Throat microphones | |
dc.keywords | Throat-microphone | |
dc.keywords | Audio recordings | |
dc.language | English | |
dc.publisher | International Speech and Communication Association | |
dc.source | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | |
dc.subject | Computer Science | |
dc.subject | Artificial intelligence | |
dc.subject | Electrical electronics engineering | |
dc.title | A new statistical excitation mapping for enhancement of throat microphone recordings | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0002-3822-235X | |
local.contributor.authorid | 0000-0002-2715-2368 | |
local.contributor.kuauthor | Turan, Mehmet Ali Tuğtekin | |
local.contributor.kuauthor | Erzin, Engin | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |