Audio-visual prediction of head-nod and turn-taking events in dyadic interactions

Publication:
Audio-visual prediction of head-nod and turn-taking events in dyadic interactions

Departments

Organizational Unit

Department of Computer Engineering

Organizational Unit

Graduate School of Sciences and Engineering

School / College / Institute

Organizational Unit

College of Engineering

Organizational Unit

GRADUATE SCHOOL OF SCIENCES AND ENGINEERING

Upper Org Unit

KU-Authors

Faculty Member, Erzin, Engin

Faculty Member, Sezgin, Tevfik Metin

PhD Student, Türker, Bekir Berker

Faculty Member, Yemez, Yücel

Publication Date

2018

Type

Conference Proceeding

Abstract

Head-nods and turn-taking both significantly contribute conversational dynamics in dyadic interactions. Timely prediction and use of these events is quite valuable for dialog management systems in human-robot interaction. in this study, we present an audio-visual prediction framework for the head-nod and turn taking events that can also be utilized in real-time systems. Prediction systems based on Support vector Machines (SVM) and Long Short-Term Memory Recurrent Neural Networks (LSTM-RNN) are trained on human-human conversational data. Unimodal and multi-modal classification performances of head-nod and turn-taking events are reported over the IEMOCaP dataset.

Publisher

Isca-int Speech Communication assoc

Subject

Computer Science, Artificial intelligence, Electrical electronics engineering

Source

19th Annual Conference of the international Speech Communication Association (interspeech 2018), Vols 1-6: Speech Research for Emerging Markets in Multilingual Societies

DOI

10.21437/interspeech.2018-2215

URI

https://doi.org/10.21437/interspeech.2018-2215
https://hdl.handle.net/20.500.14288/14715

Publication:
Audio-visual prediction of head-nod and turn-taking events in dyadic interactions

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Publication Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

0

Views

0

Downloads

Publication: Audio-visual prediction of head-nod and turn-taking events in dyadic interactions

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Publication Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

0

Views

0

Downloads

Publication:
Audio-visual prediction of head-nod and turn-taking events in dyadic interactions