Reinforced Natural Language Interfaces via Entropy Decomposition

Xiaoran Wu; Yipeng Kang

arXiv:2109.11408·cs.HC·January 8, 2024

Reinforced Natural Language Interfaces via Entropy Decomposition

Xiaoran Wu, Yipeng Kang

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach that decomposes language uncertainty into structural and functional components, enabling conversational agents to adapt to new tasks and learn effective communication protocols efficiently.

Contribution

It proposes a novel entropy decomposition method combined with reinforcement learning for adaptive natural language interfaces, improving task-specific communication.

Findings

01

Effective adaptation to unseen tasks demonstrated in experiments

02

Agents learn succinct, helpful communication protocols

03

Method outperforms baseline approaches in test scenarios

Abstract

In this paper, we study the technical problem of developing conversational agents that can quickly adapt to unseen tasks, learn task-specific communication tactics, and help listeners finish complex, temporally extended tasks. We find that the uncertainty of language learning can be decomposed to an entropy term and a mutual information term, corresponding to the structural and functional aspect of language, respectively. Combined with reinforcement learning, our method automatically requests human samples for training when adapting to new tasks and learns communication protocols that are succinct and helpful for task completion. Human and simulation test results on a referential game and a 3D navigation game prove the effectiveness of the proposed method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Reinforcement Learning in Robotics