Publication: Batch recurrent Q-Learning for backchannel generation towards engaging agents
dc.contributor.coauthor | N/A | |
dc.contributor.department | N/A | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Hussain, Nusrah | |
dc.contributor.kuauthor | Erzin, Engin | |
dc.contributor.kuauthor | Sezgin, Tevfik Metin | |
dc.contributor.kuauthor | Yemez, Yücel | |
dc.contributor.kuprofile | PhD Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Computer Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 34503 | |
dc.contributor.yokid | 18632 | |
dc.contributor.yokid | 107907 | |
dc.date.accessioned | 2024-11-09T23:07:57Z | |
dc.date.issued | 2019 | |
dc.description.abstract | The ability to generate appropriate verbal and nonverbal backchannels by an agent during human-robot interaction greatly enhances the interaction experience. Backchannels are particularly important in applications like tutoring and counseling, which require constant attention and engagement of the user. We present here a method for training a robot for backchannel generation during a human-robot interaction within the reinforcement learning (RL) framework, with the goal of maintaining high engagement level. Since online learning by interaction with a human is highly time-consuming and impractical, we take advantage of the recorded human-to-human dataset and approach our problem as a batch reinforcement learning problem. The dataset is utilized as a batch data acquired by some behavior policy. We perform experiments with laughs as a backchannel and train an agent with value-based techniques. In particular, we demonstrate the effectiveness of recurrent layers in the approximate value function for this problem, that boosts the performance in partially observable environments. With off-policy policy evaluation, it is shown that the RL agents are expected to produce more engagement than an agent trained from imitation learning. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.identifier.doi | 10.1109/ACII.2019.8925443 | |
dc.identifier.isbn | 9781-7281-3888-6 | |
dc.identifier.link | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85077800470&doi=10.1109%2fACII.2019.8925443&partnerID=40&md5=bd33450a13412b555157995e032884e0 | |
dc.identifier.scopus | 2-s2.0-85077800470 | |
dc.identifier.uri | http://dx.doi.org/10.1109/ACII.2019.8925443 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/9236 | |
dc.identifier.wos | 522220800058 | |
dc.keywords | Batch reinforcement learning | |
dc.keywords | Engagement | |
dc.keywords | Human-robot interaction | |
dc.keywords | Partially observable | |
dc.keywords | Markov decision process | |
dc.language | English | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.source | 2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019 | |
dc.subject | Computer science | |
dc.subject | Artificial intelligence | |
dc.subject | Information systems | |
dc.subject | Engineering | |
dc.subject | Electrical and electronic engineering | |
dc.title | Batch recurrent Q-Learning for backchannel generation towards engaging agents | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | 0000-0001-8786-1871 | |
local.contributor.authorid | 0000-0002-2715-2368 | |
local.contributor.authorid | 0000-0002-1524-1646 | |
local.contributor.authorid | 0000-0002-7515-3138 | |
local.contributor.kuauthor | Hussain, Nusrah | |
local.contributor.kuauthor | Erzin, Engin | |
local.contributor.kuauthor | Sezgin, Tevfik Metin | |
local.contributor.kuauthor | Yemez, Yücel | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae |