Engagement rewarded actor-critic with conservative Q-learning for speech-driven laughter backchannel generation

Publication:
Engagement rewarded actor-critic with conservative Q-learning for speech-driven laughter backchannel generation

dc.contributor.department	Department of Computer Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.department	KUIS AI (Koç University & İş Bank Artificial Intelligence Center)
dc.contributor.kuauthor	Bayramoğlu, Öykü Zeynep
dc.contributor.kuauthor	Erzin, Engin
dc.contributor.kuauthor	Sezgin, Tevfik Metin
dc.contributor.kuauthor	Yemez, Yücel
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.contributor.schoolcollegeinstitute	Research Center
dc.date.accessioned	2024-11-09T13:56:20Z
dc.date.issued	2021
dc.description.abstract	We propose a speech-driven laughter backchannel generation model to reward engagement during human-agent interaction. We formulate the problem as a Markov decision process where speech signal represents the state and the objective is to maximize human engagement. Since online training is often impractical in the case of human-agent interaction, we utilize the existing human-to-human dyadic interaction datasets to train our agent for the backchannel generation task. We address the problem using an actor-critic method based on conservative Q-learning (CQL), that mitigates the distributional shift problem by suppressing Q-value over-estimation during training. The proposed CQL based approach is evaluated objectively on the IEMOCAP dataset for laughter generation task. When compared to the existing off-policy Q-learning methods, we observe an improved compliance with the dataset in terms of laugh generation rate. Furthermore, we show the effectiveness of the learned policy by estimating the expected engagement using off-policy policy evaluation techniques.
dc.description.fulltext	YES
dc.description.indexedby	Scopus
dc.description.openaccess	YES
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	TÜBİTAK
dc.description.sponsorship	Scientific and Technological Research Council of Turkey (TÜBİTAK)
dc.description.version	Author's final manuscript
dc.identifier.doi	10.1145/3462244.3479944
dc.identifier.embargo	NO
dc.identifier.filenameinventoryno	IR03356
dc.identifier.isbn	978-1-4503-8481-0
dc.identifier.quartile	N/A
dc.identifier.scopus	2-s2.0-85119021073
dc.identifier.uri	https://doi.org/10.1145/3462244.3479944
dc.keywords	Backchannels
dc.keywords	Human-agent interaction
dc.keywords	Offline reinforcement learning
dc.keywords	User engagement
dc.language.iso	eng
dc.publisher	Association for Computing Machinery (ACM)
dc.relation.grantno	2.17E+42
dc.relation.ispartof	International Conference on Multimodal Interaction
dc.relation.uri	http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/10144
dc.subject	Generation
dc.title	Engagement rewarded actor-critic with conservative Q-learning for speech-driven laughter backchannel generation
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Bayramoğlu, Öykü Zeynep
local.contributor.kuauthor	Erzin, Engin
local.contributor.kuauthor	Sezgin, Tevfik Metin
local.contributor.kuauthor	Yemez, Yücel
local.publication.orgunit1	College of Engineering
local.publication.orgunit1	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1	Research Center
local.publication.orgunit2	KUIS AI (Koç University & İş Bank Artificial Intelligence Center)
local.publication.orgunit2	Department of Computer Engineering
local.publication.orgunit2	Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication	77d67233-829b-4c3a-a28f-bd97ab5c12c7
relation.isOrgUnitOfPublication.latestForDiscovery	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication	d437580f-9309-4ecb-864a-4af58309d287
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 10144.pdf
Size:: 839.18 KB
Format:: Adobe Portable Document Format

Download

Collections

Publications with Fulltext

Publication: Engagement rewarded actor-critic with conservative Q-learning for speech-driven laughter backchannel generation

Files

Original bundle

Collections

Publication:
Engagement rewarded actor-critic with conservative Q-learning for speech-driven laughter backchannel generation