Publication: Audio-facial laughter detection in naturalistic dyadic conversations
dc.contributor.coauthor | N/A | |
dc.contributor.department | N/A | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Türker, Bekir Berker | |
dc.contributor.kuauthor | Yemez, Yücel | |
dc.contributor.kuauthor | Sezgin, Tevfik Metin | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuprofile | PhD Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 107907 | |
dc.contributor.yokid | 18632 | |
dc.contributor.yokid | 34503 | |
dc.date.accessioned | 2024-11-10T00:12:08Z | |
dc.date.issued | 2017 | |
dc.description.abstract | We address the problem of continuous laughter detection over audio-facial input streams obtained from naturalistic dyadic conversations. We first present meticulous annotation of laughters, cross-talks and environmental noise in an audio-facial database with explicit 3D facial mocap data. Using this annotated database, we rigorously investigate the utility of facial information, head movement and audio features for laughter detection. We identify a set of discriminative features using mutual information-based criteria, and show how they can be used with classifiers based on support vector machines (SVMs) and time delay neural networks (TDNNs). Informed by the analysis of the individual modalities, we propose a multimodal fusion setup for laughter detection using different classifier-feature combinations. We also effectively incorporate bagging into our classification pipeline to address the class imbalance problem caused by the scarcity of positive laughter instances. Our results indicate that a combination of TDNNs and SVMs lead to superior detection performance, and bagging effectively addresses data imbalance. Our experiments show that our multimodal approach supported by bagging compares favorably to the state of the art in presence of detrimental factors such as cross-talk, environmental noise, and data imbalance. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.issue | 4 | |
dc.description.openaccess | NO | |
dc.description.sponsorship | ERA-Net CHIST-ERA under JOKER | |
dc.description.sponsorship | Turkish Scientific and Technical Research Council (TUBITAK) [113E324] This work is supported by ERA-Net CHIST-ERA under the JOKER project and Turkish Scientific and Technical Research Council (TUBITAK) under grant number 113E324. | |
dc.description.volume | 8 | |
dc.identifier.doi | 10.1109/TAFFC.2017.2754256 | |
dc.identifier.issn | 1949-3045 | |
dc.identifier.scopus | 2-s2.0-85030642834 | |
dc.identifier.uri | http://dx.doi.org/10.1109/TAFFC.2017.2754256 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/17595 | |
dc.identifier.wos | 417921000011 | |
dc.keywords | Laughter detection | |
dc.keywords | Naturalistic dyadic conversations | |
dc.keywords | Facial mocap | |
dc.keywords | Data imbalance | |
dc.keywords | Speech | |
dc.language | English | |
dc.publisher | Ieee-Inst Electrical Electronics Engineers Inc | |
dc.source | Ieee Transactions On Affective Computing | |
dc.subject | Computer science | |
dc.subject | Artificial intelligence | |
dc.subject | Computer science | |
dc.subject | Cybernetics | |
dc.title | Audio-facial laughter detection in naturalistic dyadic conversations | |
dc.type | Journal Article | |
dspace.entity.type | Publication | |
local.contributor.authorid | N/A | |
local.contributor.authorid | 0000-0002-7515-3138 | |
local.contributor.authorid | 0000-0002-1524-1646 | |
local.contributor.authorid | 0000-0002-2715-2368 | |
local.contributor.kuauthor | Türker, Bekir Berker | |
local.contributor.kuauthor | Yemez, Yücel | |
local.contributor.kuauthor | Sezgin, Tevfik Metin | |
local.contributor.kuauthor | Erzin, Engin | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |