Vocal tract contour tracking in rtMRI using deep temporal regression network

Publication:
Vocal tract contour tracking in rtMRI using deep temporal regression network

dc.contributor.department	Department of Electrical and Electronics Engineering
dc.contributor.department	Department of Computer Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.kuauthor	Asadiabadi, Sasan
dc.contributor.kuauthor	Erzin, Engin
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-09T12:11:35Z
dc.date.issued	2020
dc.description.abstract	Recent advances in real-time Magnetic Resonance Imaging (rtMRI) provide an invaluable tool to study speech articulation. In this paper, we present an effective deep learning approach for supervised detection and tracking of vocal tract contours in a sequence of rtMRI frames. We train a single input multiple output deep temporal regression network (DTRN) to detect the vocal tract (VT) contour and the separation boundary between different articulators. The DTRN learns the non-linear mapping from an overlapping fixed-length sequence of rtMRI frames to the corresponding articulatory movements, where a blend of the overlapping contour estimates defines the detected VT contour. The detected contour is refined at a post-processing stage using an appearance model to further improve the accuracy of VT contour detection. The proposed VT contour tracking model is trained and evaluated over the USC-TIMIT dataset. Performance evaluation is carried out using three objective assessment metrics for the separating landmark detection, contour tracking and temporal stability of the contour landmarks in comparison with three baseline approaches from the recent literature. Results indicate significant improvements with the proposed method over the state-of-the-art baselines.
dc.description.fulltext	YES
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.openaccess	YES
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	N/A
dc.description.sponsorship	N/A
dc.description.version	Author's final manuscript
dc.description.volume	28
dc.identifier.doi	10.1109/TASLP.2020.3036182
dc.identifier.eissn	2329-9304
dc.identifier.embargo	NO
dc.identifier.filenameinventoryno	IR02614
dc.identifier.issn	2329-9290
dc.identifier.quartile	N/A
dc.identifier.scopus	2-s2.0-85096832888
dc.identifier.uri	https://hdl.handle.net/20.500.14288/1077
dc.identifier.wos	595525300004
dc.keywords	Estimation
dc.keywords	Magnetic resonance imaging
dc.keywords	Speech processing
dc.keywords	Image segmentation
dc.keywords	Training
dc.keywords	Heating systems
dc.keywords	Tracking
dc.keywords	Appearance model
dc.keywords	Contour detection
dc.keywords	Deep neural network
dc.keywords	Real-time magnetic resonance imaging (rtMRI)
dc.keywords	Speech production
dc.keywords	Vocal tract
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.grantno	NA
dc.relation.ispartof	IEEE/ACM Transactions on Audio, Speech, and Language Processing
dc.relation.uri	http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/9253
dc.subject	Acoustics
dc.subject	Engineering
dc.title	Vocal tract contour tracking in rtMRI using deep temporal regression network
dc.type	Journal Article
dspace.entity.type	Publication
local.contributor.kuauthor	Asadiabadi, Sasan
local.contributor.kuauthor	Erzin, Engin
local.publication.orgunit1	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1	College of Engineering
local.publication.orgunit2	Department of Electrical and Electronics Engineering
local.publication.orgunit2	Department of Computer Engineering
local.publication.orgunit2	Graduate School of Sciences and Engineering
person.familyName	Asadiabadi
person.familyName	Erzin
person.givenName	Sasan
person.givenName	Engin
relation.isOrgUnitOfPublication	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 9253.pdf
Size:: 1.01 MB
Format:: Adobe Portable Document Format

Download

Collections

Publications with Fulltext

Publication: Vocal tract contour tracking in rtMRI using deep temporal regression network

Files

Original bundle

Collections

Publication:
Vocal tract contour tracking in rtMRI using deep temporal regression network