Batch recurrent Q-Learning for backchannel generation towards engaging agents

Publication:
Batch recurrent Q-Learning for backchannel generation towards engaging agents

dc.contributor.coauthor	N/A
dc.contributor.department	Department of Computer Engineering
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.kuauthor	Erzin, Engin
dc.contributor.kuauthor	Hussain, Nusrah
dc.contributor.kuauthor	Sezgin, Tevfik Metin
dc.contributor.kuauthor	Yemez, Yücel
dc.contributor.schoolcollegeinstitute	College of Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned	2024-11-09T23:07:57Z
dc.date.issued	2019
dc.description.abstract	The ability to generate appropriate verbal and nonverbal backchannels by an agent during human-robot interaction greatly enhances the interaction experience. Backchannels are particularly important in applications like tutoring and counseling, which require constant attention and engagement of the user. We present here a method for training a robot for backchannel generation during a human-robot interaction within the reinforcement learning (RL) framework, with the goal of maintaining high engagement level. Since online learning by interaction with a human is highly time-consuming and impractical, we take advantage of the recorded human-to-human dataset and approach our problem as a batch reinforcement learning problem. The dataset is utilized as a batch data acquired by some behavior policy. We perform experiments with laughs as a backchannel and train an agent with value-based techniques. In particular, we demonstrate the effectiveness of recurrent layers in the approximate value function for this problem, that boosts the performance in partially observable environments. With off-policy policy evaluation, it is shown that the RL agents are expected to produce more engagement than an agent trained from imitation learning.
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.openaccess	YES
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	N/A
dc.identifier.doi	10.1109/ACII.2019.8925443
dc.identifier.isbn	9781-7281-3888-6
dc.identifier.scopus	2-s2.0-85077800470
dc.identifier.uri	https://doi.org/10.1109/ACII.2019.8925443
dc.identifier.uri	https://hdl.handle.net/20.500.14288/9236
dc.identifier.wos	522220800058
dc.keywords	Batch reinforcement learning
dc.keywords	Engagement
dc.keywords	Human-robot interaction
dc.keywords	Partially observable
dc.keywords	Markov decision process
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof	2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019
dc.subject	Computer science
dc.subject	Artificial intelligence
dc.subject	Information systems
dc.subject	Engineering
dc.subject	Electrical and electronic engineering
dc.title	Batch recurrent Q-Learning for backchannel generation towards engaging agents
dc.type	Conference Proceeding
dspace.entity.type	Publication
local.contributor.kuauthor	Hussain, Nusrah
local.contributor.kuauthor	Erzin, Engin
local.contributor.kuauthor	Sezgin, Tevfik Metin
local.contributor.kuauthor	Yemez, Yücel
local.publication.orgunit1	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1	College of Engineering
local.publication.orgunit2	Department of Computer Engineering
local.publication.orgunit2	Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication	3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery	89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication	8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication	434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery	8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Collections

Publications without Fulltext

Publication: Batch recurrent Q-Learning for backchannel generation towards engaging agents

Files

Collections

Publication:
Batch recurrent Q-Learning for backchannel generation towards engaging agents