Publication:
Audio-visual prediction of head-nod and turn-taking events in dyadic interactions

dc.conference.dateAUG 02-SEP 06, 2018
dc.conference.locationHyderabad, India
dc.conference.organizer19th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2018)
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.facultymemberYes
dc.contributor.kuauthorErzin, Engin
dc.contributor.kuauthorSezgin, Tevfik Metin
dc.contributor.kuauthorTürker, Bekir Berker
dc.contributor.kuauthorYemez, Yücel
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned2024-11-09T23:51:28Z
dc.date.issued2018
dc.description.abstractHead-nods and turn-taking both significantly contribute conversational dynamics in dyadic interactions. Timely prediction and use of these events is quite valuable for dialog management systems in human-robot interaction. in this study, we present an audio-visual prediction framework for the head-nod and turn taking events that can also be utilized in real-time systems. Prediction systems based on Support vector Machines (SVM) and Long Short-Term Memory Recurrent Neural Networks (LSTM-RNN) are trained on human-human conversational data. Unimodal and multi-modal classification performances of head-nod and turn-taking events are reported over the IEMOCaP dataset.
dc.description.fulltextYes
dc.description.harvestedfromManual
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.openaccessGreen OA
dc.description.peerreviewstatusN/A
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsorshipTurkish Scientific and Technical Research Council (TUBITaK) [113E324, 217E040] This work is supported by Turkish Scientific and Technical Research Council (TUBITaK) under grant numbers 113E324 and 217E040.
dc.description.studentonlypublicationNo
dc.description.studentpublicationYes
dc.description.versionPost-print
dc.identifier.doi10.21437/interspeech.2018-2215
dc.identifier.embargoN/A
dc.identifier.endpage1745
dc.identifier.filenameinventorynoIR06879
dc.identifier.isbn9781510872219
dc.identifier.issn2308-457X
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85054959957
dc.identifier.startpage1741
dc.identifier.urihttps://doi.org/10.21437/interspeech.2018-2215
dc.identifier.urihttps://hdl.handle.net/20.500.14288/14715
dc.identifier.wos000465363900364
dc.keywordsHead-nod
dc.keywordsTurn-taking
dc.keywordsSocial signals
dc.keywordsEvent prediction
dc.keywordsDyadic conversations
dc.keywordsHuman-robot interaction
dc.language.isoeng
dc.publisherInternational Speech Communication Association (ISCA)
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartof19th Annual Conference of the international Speech Communication Association (interspeech 2018), Vols 1-6: Speech Research for Emerging Markets in Multilingual Societies
dc.relation.openaccessYes
dc.rightsOther
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subjectElectrical electronics engineering
dc.titleAudio-visual prediction of head-nod and turn-taking events in dyadic interactions
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorTürker, Bekir Berker
local.contributor.kuauthorErzin, Engin
local.contributor.kuauthorYemez, Yücel
local.contributor.kuauthorSezgin, Tevfik Metin
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
IR06879.pdf
Size:
166.66 KB
Format:
Adobe Portable Document Format