Publication:
Use of affect context in dyadic interactions for continuous emotion recognition

dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.facultymemberYes
dc.contributor.kuauthorFatima, Syeda Narjis
dc.contributor.kuauthorErzin, Engin
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-11-09T23:14:03Z
dc.date.issued2021
dc.description.abstractEmotional dependencies play a crucial role in understanding complexities of dyadic interactions. Recent studies have shown that affect recognition tasks can benefit by the incorporation of a particular interaction's context, however, the investigation of affect context in dyadic settings using neural network frameworks remains a complex and open problem. In this paper, we formulate the concept of dyadic affect context (DAC) and propose convolutional neural network (CNN) based architectures to model and incorporate DAC to improve continuous emotion recognition (CER) in dyadic scenarios. We begin by defining a CNN architecture for single-subject CER-based on speech and body motion data. We then introduce dyadic CER as a two-stage regression framework. Specifically, we propose two dyadic CNN architectures where cross-speaker affect contribution to the CER task is achieved by: (i) the fusion of cross-subject affect (FoA) or (ii) the fusion of cross-subject feature maps (FoM). Based on the preceding dyadic models, we finally propose a new Convolutional LSTM (ConvLSTM) model for the dyadic CER. ConvLSTM architecture captures local spectro-temporal correlations in speech and body motion as well as the long-term affect inter-dependencies between subjects. Our multimodal analysis demonstrates that modeling and incorporation of the DAC in the proposed CER models provide significant performance improvements on the USC CreativeIT database and the achieved results compare favorably to the state-of-the-art.
dc.description.fulltextNo
dc.description.harvestedfromManual
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.openaccessNO
dc.description.peerreviewstatusN/A
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuN/A
dc.description.studentonlypublicationNo
dc.description.studentpublicationYes
dc.description.versionN/A
dc.identifier.doi10.1016/j.specom.2021.05.010
dc.identifier.eissn1872-7182
dc.identifier.embargoN/A
dc.identifier.endpage82
dc.identifier.issn0167-6393
dc.identifier.quartileQ1
dc.identifier.scopus2-s2.0-85107675361
dc.identifier.startpage70
dc.identifier.urihttps://doi.org/10.1016/j.specom.2021.05.010
dc.identifier.urihttps://hdl.handle.net/20.500.14288/10079
dc.identifier.volume132
dc.identifier.wos000680415400007
dc.keywordsDyadic interactions
dc.keywordsContinuous emotion recognition (CER)
dc.keywordsDyadic affect context (DAC)
dc.keywordsCNN
dc.keywordsConvLSTM
dc.language.isoeng
dc.publisherElsevier
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartofSpeech Communication
dc.relation.openaccessN/A
dc.rightsN/A
dc.subjectAcoustics
dc.subjectComputer science, interdisciplinary applications
dc.titleUse of affect context in dyadic interactions for continuous emotion recognition
dc.typeJournal Article
dspace.entity.typePublication
local.contributor.kuauthorFatima, Syeda Narjis
relation.isOrgUnitOfPublication21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication.latestForDiscovery21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files