Engagement rewarded actor-critic with conservative Q-learning for speech-driven laughter backchannel generation

We propose a speech-driven laughter backchannel generation model to reward engagement during human-agent interaction. We formulate the problem as a Markov decision process where speech signal represents the state and the objective is to maximize human engagement. Since online training is often impractical in the case of human-agent interaction, we utilize the existing human-to-human dyadic interaction datasets to train our agent for the backchannel generation task. We address the problem using an actor-critic method based on conservative Q-learning (CQL), that mitigates the distributional shift problem by suppressing Q-value over-estimation during training. The proposed CQL based approach is evaluated objectively on the IEMOCAP dataset for laughter generation task. When compared to the existing off-policy Q-learning methods, we observe an improved compliance with the dataset in terms of laugh generation rate. Furthermore, we show the effectiveness of the learned policy by estimating the expected engagement using off-policy policy evaluation techniques.

Publisher

Association for Computing Machinery (ACM)

Subject

Generation

Source

International Conference on Multimodal Interaction

DOI

10.1145/3462244.3479944

URI

https://doi.org/10.1145/3462244.3479944

Collections

Publications with Fulltext

Full item page

1

Views

6

Downloads

View PlumX Details

Publication:
Engagement rewarded actor-critic with conservative Q-learning for speech-driven laughter backchannel generation

Files

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Publication Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

1

Views

6

Downloads

Publication: Engagement rewarded actor-critic with conservative Q-learning for speech-driven laughter backchannel generation

Files

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Publication Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

1

Views

6

Downloads

Publication:
Engagement rewarded actor-critic with conservative Q-learning for speech-driven laughter backchannel generation