Publication:
Audio-visual prediction of head-nod and turn-taking events in dyadic interactions

Thumbnail Image

School / College / Institute

Organizational Unit

Program

KU Authors

Co-Authors

Editor & Affiliation

Compiler & Affiliation

Translator

Other Contributor

Date

Language

Embargo Status

N/A

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Head-nods and turn-taking both significantly contribute conversational dynamics in dyadic interactions. Timely prediction and use of these events is quite valuable for dialog management systems in human-robot interaction. in this study, we present an audio-visual prediction framework for the head-nod and turn taking events that can also be utilized in real-time systems. Prediction systems based on Support vector Machines (SVM) and Long Short-Term Memory Recurrent Neural Networks (LSTM-RNN) are trained on human-human conversational data. Unimodal and multi-modal classification performances of head-nod and turn-taking events are reported over the IEMOCaP dataset.

Source

Publisher

International Speech Communication Association (ISCA)

Subject

Computer science, Artificial intelligence, Electrical electronics engineering

Citation

Has Part

Source

19th Annual Conference of the international Speech Communication Association (interspeech 2018), Vols 1-6: Speech Research for Emerging Markets in Multilingual Societies

Book Series Title

Edition

DOI

10.21437/interspeech.2018-2215

item.page.datauri

Link

Rights

Other

Copyrights Note

Endorsement

Review

Supplemented By

Referenced By

Related Goal

1

Views

1

Downloads

View PlumX Details