Department of Computer Engineering2024-11-1020169781-4799-9988-01520-614910.1109/ICASSP.2016.74721802-s2.0-84973402468http://dx.doi.org/10.1109/ICASSP.2016.7472180https://hdl.handle.net/20.500.14288/17464In human-to-human communication gesture and speech co-exist in time with a tight synchrony, where we tend to use gestures to complement or to emphasize speech. In this study, we investigate roles of vocal and gestural cues to identify a dyadic interaction as agreement and disagreement. In this investigation we use the JESTKOD database, which consists of speech and full-body motion capture data recordings for dyadic interactions under agreement and disagreement scenarios. Spectral features of vocal channel and upper body joint angles of gestural channel are employed to extract unimodal and multimodal classification performances. Both of the modalities attain classification rates significantly above the chance level and the multimodal classifier performed more than 80% classification rate over 15 second utterances using statistical features of speech and motion.AcousticsEngineeringElectrical and electronic engineeringAgreement and disagreement classification of dyadic interactions using vocal and gestural cuesConference proceedinghttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84973402468&doi=10.1109%2fICASSP.2016.7472180&partnerID=40&md5=d0caa94ef4d2aa4a681109cb21def0333883734021814643