Publication: Enhancement of throat microphone recordings by learning phone-dependent mappings of speech spectra
dc.contributor.department | N/A | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Turan, Mehmet Ali Tuğtekin | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuprofile | PhD Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 34503 | |
dc.date.accessioned | 2024-11-10T00:12:44Z | |
dc.date.issued | 2013 | |
dc.description.abstract | We investigate spectral envelope mapping problem with joint analysis of throat- and acoustic-microphone recordings to enhance throatmicrophone speech. A new phone-dependent GMM-based spectral envelope mapping scheme, which performs the minimum mean square error (MMSE) estimation of the acoustic-microphone spectral envelope, has been proposed. Experimental evaluations are performed to compare the proposed mapping scheme to the state-of-theart GMM-based estimator using both objective and subjective evaluations. Objective evaluations are performed with the log-spectral distortion (LSD) and the wideband perceptual evaluation of speech quality (PESQ) metrics. Subjective evaluations are performed with the A/B pair comparison listening test. Both objective and subjective evaluations yield that the proposed phone-dependent mapping consistently improves performances over the state-of-the-art GMM estimator. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsorship | IEE Signal Processing Society | |
dc.identifier.doi | 10.1109/ICASSP.2013.6639029 | |
dc.identifier.isbn | 9781-4799-0356-6 | |
dc.identifier.issn | 1520-6149 | |
dc.identifier.link | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84890457569&doi=10.1109%2fICASSP.2013.6639029&partnerID=40&md5=ac71746b68d6b7b1ca203f551dff8948 | |
dc.identifier.scopus | 2-s2.0-84890457569 | |
dc.identifier.uri | http://dx.doi.org/10.1109/ICASSP.2013.6639029 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/17706 | |
dc.identifier.wos | 329611507042 | |
dc.keywords | Throat-microphone | |
dc.keywords | Speech enhancement | |
dc.keywords | Spectral envelope estimation | |
dc.language | English | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.source | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | |
dc.subject | Acoustics | |
dc.subject | Electrical electronic engineering | |
dc.title | Enhancement of throat microphone recordings by learning phone-dependent mappings of speech spectra | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0002-3822-235X | |
local.contributor.authorid | 0000-0002-2715-2368 | |
local.contributor.kuauthor | Turan, Mehmet Ali Tuğtekin | |
local.contributor.kuauthor | Erzin, Engin | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |