Publication: Multimodal analysis of speech prosody and upper body gestures using hidden semi-Markov models
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Graduate School of Sciences and Engineering | |
dc.contributor.kuauthor | Asta, Shahriar | |
dc.contributor.kuauthor | Bozkurt, Elif | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuauthor | Özkul, Serkan | |
dc.contributor.kuauthor | Yemez, Yücel | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | GRADUATE SCHOOL OF SCIENCES AND ENGINEERING | |
dc.date.accessioned | 2024-11-10T00:12:18Z | |
dc.date.issued | 2013 | |
dc.description.abstract | Gesticulation is an essential component of face-to-face communication, and it contributes significantly to the natural and affective perception of human-to-human communication. In this work we investigate a new multimodal analysis framework to model relationships between intonational and gesture phrases using the hidden semi-Markov models (HSMMs). The HSMM framework effectively associates longer duration gesture phrases to shorter duration prosody clusters, while maintaining realistic gesture phrase duration statistics. We evaluate the multimodal analysis framework by generating speech prosody driven gesture animation, and employing both subjective and objective metrics. | |
dc.description.indexedby | WOS | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.description.sponsorship | IEE Signal Processing Society | |
dc.identifier.doi | 10.1109/ICASSP.2013.6638339 | |
dc.identifier.isbn | 9781-4799-0356-6 | |
dc.identifier.issn | 1520-6149 | |
dc.identifier.scopus | 2-s2.0-84890457155 | |
dc.identifier.uri | https://doi.org/10.1109/ICASSP.2013.6638339 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/17636 | |
dc.identifier.wos | 329611503162 | |
dc.keywords | Prosody analysis | |
dc.keywords | Gesture segmentation | |
dc.keywords | Gesture animation | |
dc.language.iso | eng | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.relation.ispartof | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | |
dc.subject | Acoustics | |
dc.subject | Electrical electronics engineering | |
dc.title | Multimodal analysis of speech prosody and upper body gestures using hidden semi-Markov models | |
dc.type | Conference Proceeding | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Bozkurt, Elif | |
local.contributor.kuauthor | Asta, Shahriar | |
local.contributor.kuauthor | Özkul, Serkan | |
local.contributor.kuauthor | Yemez, Yücel | |
local.contributor.kuauthor | Erzin, Engin | |
local.publication.orgunit1 | GRADUATE SCHOOL OF SCIENCES AND ENGINEERING | |
local.publication.orgunit1 | College of Engineering | |
local.publication.orgunit2 | Department of Computer Engineering | |
local.publication.orgunit2 | Graduate School of Sciences and Engineering | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication | 3fc31c89-e803-4eb1-af6b-6258bc42c3d8 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isParentOrgUnitOfPublication | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 | |
relation.isParentOrgUnitOfPublication | 434c9663-2b11-4e66-9399-c863e2ebae43 | |
relation.isParentOrgUnitOfPublication.latestForDiscovery | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 |