Audio-facial laughter detection in naturalistic dyadic conversations

Publication:
Audio-facial laughter detection in naturalistic dyadic conversations

dc.contributor.coauthor	N/A
dc.contributor.department	Department of Computer Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.kuauthor	Erzin, Engin
dc.contributor.kuauthor	Sezgin, Tevfik Metin
dc.contributor.kuauthor	Türker, Bekir Berker
dc.contributor.kuauthor	Yemez, Yücel
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-10T00:12:08Z
dc.date.issued	2017
dc.description.abstract	We address the problem of continuous laughter detection over audio-facial input streams obtained from naturalistic dyadic conversations. We first present meticulous annotation of laughters, cross-talks and environmental noise in an audio-facial database with explicit 3D facial mocap data. Using this annotated database, we rigorously investigate the utility of facial information, head movement and audio features for laughter detection. We identify a set of discriminative features using mutual information-based criteria, and show how they can be used with classifiers based on support vector machines (SVMs) and time delay neural networks (TDNNs). Informed by the analysis of the individual modalities, we propose a multimodal fusion setup for laughter detection using different classifier-feature combinations. We also effectively incorporate bagging into our classification pipeline to address the class imbalance problem caused by the scarcity of positive laughter instances. Our results indicate that a combination of TDNNs and SVMs lead to superior detection performance, and bagging effectively addresses data imbalance. Our experiments show that our multimodal approach supported by bagging compares favorably to the state of the art in presence of detrimental factors such as cross-talk, environmental noise, and data imbalance.
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.issue	4
dc.description.openaccess	NO
dc.description.sponsoredbyTubitakEu	N/A
dc.description.sponsorship	ERA-Net CHIST-ERA under JOKER
dc.description.sponsorship	Turkish Scientific and Technical Research Council (TUBITAK) [113E324] This work is supported by ERA-Net CHIST-ERA under the JOKER project and Turkish Scientific and Technical Research Council (TUBITAK) under grant number 113E324.
dc.description.volume	8
dc.identifier.doi	10.1109/TAFFC.2017.2754256
dc.identifier.issn	1949-3045
dc.identifier.scopus	2-s2.0-85030642834
dc.identifier.uri	https://doi.org/10.1109/TAFFC.2017.2754256
dc.identifier.uri	https://hdl.handle.net/20.500.14288/17595
dc.identifier.wos	417921000011
dc.keywords	Laughter detection
dc.keywords	Naturalistic dyadic conversations
dc.keywords	Facial mocap
dc.keywords	Data imbalance
dc.keywords	Speech
dc.language.iso	eng
dc.publisher	Ieee-Inst Electrical Electronics Engineers Inc
dc.relation.ispartof	Ieee Transactions On Affective Computing
dc.subject	Computer science
dc.subject	Artificial intelligence
dc.subject	Computer science
dc.subject	Cybernetics
dc.title	Audio-facial laughter detection in naturalistic dyadic conversations
dc.type	Journal Article
dspace.entity.type	Publication
local.contributor.kuauthor	Türker, Bekir Berker
local.contributor.kuauthor	Yemez, Yücel
local.contributor.kuauthor	Sezgin, Tevfik Metin
local.contributor.kuauthor	Erzin, Engin
local.publication.orgunit1	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1	College of Engineering
local.publication.orgunit2	Department of Computer Engineering
local.publication.orgunit2	Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Collections

Publications without Fulltext

Publication: Audio-facial laughter detection in naturalistic dyadic conversations

Files

Collections

Publication:
Audio-facial laughter detection in naturalistic dyadic conversations