Publication:
Audio-facial laughter detection in naturalistic dyadic conversations

dc.contributor.coauthorN/A
dc.contributor.departmentN/A
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorTürker, Bekir Berker
dc.contributor.kuauthorYemez, Yücel
dc.contributor.kuauthorSezgin, Tevfik Metin
dc.contributor.kuauthorErzin, Engin
dc.contributor.kuprofilePhD Student
dc.contributor.kuprofileFaculty Member
dc.contributor.kuprofileFaculty Member
dc.contributor.kuprofileFaculty Member
dc.contributor.otherDepartment of Computer Engineering
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.yokidN/A
dc.contributor.yokid107907
dc.contributor.yokid18632
dc.contributor.yokid34503
dc.date.accessioned2024-11-10T00:12:08Z
dc.date.issued2017
dc.description.abstractWe address the problem of continuous laughter detection over audio-facial input streams obtained from naturalistic dyadic conversations. We first present meticulous annotation of laughters, cross-talks and environmental noise in an audio-facial database with explicit 3D facial mocap data. Using this annotated database, we rigorously investigate the utility of facial information, head movement and audio features for laughter detection. We identify a set of discriminative features using mutual information-based criteria, and show how they can be used with classifiers based on support vector machines (SVMs) and time delay neural networks (TDNNs). Informed by the analysis of the individual modalities, we propose a multimodal fusion setup for laughter detection using different classifier-feature combinations. We also effectively incorporate bagging into our classification pipeline to address the class imbalance problem caused by the scarcity of positive laughter instances. Our results indicate that a combination of TDNNs and SVMs lead to superior detection performance, and bagging effectively addresses data imbalance. Our experiments show that our multimodal approach supported by bagging compares favorably to the state of the art in presence of detrimental factors such as cross-talk, environmental noise, and data imbalance.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.issue4
dc.description.openaccessNO
dc.description.sponsorshipERA-Net CHIST-ERA under JOKER
dc.description.sponsorshipTurkish Scientific and Technical Research Council (TUBITAK) [113E324] This work is supported by ERA-Net CHIST-ERA under the JOKER project and Turkish Scientific and Technical Research Council (TUBITAK) under grant number 113E324.
dc.description.volume8
dc.identifier.doi10.1109/TAFFC.2017.2754256
dc.identifier.issn1949-3045
dc.identifier.scopus2-s2.0-85030642834
dc.identifier.urihttp://dx.doi.org/10.1109/TAFFC.2017.2754256
dc.identifier.urihttps://hdl.handle.net/20.500.14288/17595
dc.identifier.wos417921000011
dc.keywordsLaughter detection
dc.keywordsNaturalistic dyadic conversations
dc.keywordsFacial mocap
dc.keywordsData imbalance
dc.keywordsSpeech
dc.languageEnglish
dc.publisherIeee-Inst Electrical Electronics Engineers Inc
dc.sourceIeee Transactions On Affective Computing
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subjectComputer science
dc.subjectCybernetics
dc.titleAudio-facial laughter detection in naturalistic dyadic conversations
dc.typeJournal Article
dspace.entity.typePublication
local.contributor.authoridN/A
local.contributor.authorid0000-0002-7515-3138
local.contributor.authorid0000-0002-1524-1646
local.contributor.authorid0000-0002-2715-2368
local.contributor.kuauthorTürker, Bekir Berker
local.contributor.kuauthorYemez, Yücel
local.contributor.kuauthorSezgin, Tevfik Metin
local.contributor.kuauthorErzin, Engin
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae

Files