Publication: Automatic vocal tract landmark tracking in rtMRI using fully convolutional networks and Kalman filter
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuauthor | Asadiabadi, Sasan | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.yokid | 34503 | |
dc.contributor.yokid | N/A | |
dc.date.accessioned | 2024-11-09T11:51:23Z | |
dc.date.issued | 2020 | |
dc.description.abstract | Vocal tract (VT) contour detection in real time MRI is a pre-stage to many speech production related applications such as articulatory analysis and synthesis. In this work, we present an algorithm for robust detection of keypoints on the vocal tract in rtMRI sequences using fully convolutional networks (FCN) via a heatmap regression approach. We as well introduce a spatio-temporal stabilization scheme based on a combination of Principal Component Analysis (PCA) and Kalman filter (KF) to extract stable landmarks in space and time. The proposed VT landmark detection algorithm generalizes well across subjects and demonstrates significant improvement over the state of the art baselines, in terms of spatial and temporal errors. | |
dc.description.fulltext | YES | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.description.sponsorship | N/A | |
dc.description.version | Author's final manuscript | |
dc.format | ||
dc.identifier.doi | 10.1109/ICASSP40776.2020.9054332 | |
dc.identifier.eissn | 2379-190X | |
dc.identifier.embargo | NO | |
dc.identifier.filenameinventoryno | IR02739 | |
dc.identifier.isbn | 9781509066315 | |
dc.identifier.issn | 1520-6149 | |
dc.identifier.link | https://doi.org/10.1109/ICASSP40776.2020.9054332 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-85089213512 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/709 | |
dc.keywords | Vocal tract dynamics | |
dc.keywords | Fully convolutional networks | |
dc.keywords | Heatmap regression | |
dc.keywords | Kalman filter | |
dc.language | English | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.relation.grantno | NA | |
dc.relation.uri | http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/9418 | |
dc.source | 2020 IEEE International Conference On Acoustics, Speech And Signal Processing (ICASSP) | |
dc.subject | Heating systems | |
dc.subject | Convolution | |
dc.subject | Signal processing algorithms | |
dc.title | Automatic vocal tract landmark tracking in rtMRI using fully convolutional networks and Kalman filter | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0002-2715-2368 | |
local.contributor.authorid | N/A | |
local.contributor.kuauthor | Erzin, Engin | |
local.contributor.kuauthor | Asadiabadi, Sasan | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 |
Files
Original bundle
1 - 1 of 1