Self-play for Data Efficient Language Acquisition
Charles Lovering, Ellie Pavlick

TL;DR
This paper introduces a self-play approach to improve data-efficient language learning in agents by exploiting the symmetry in communication roles, enabling better understanding and generation with limited supervision.
Contribution
It demonstrates that self-play can effectively transfer knowledge between understanding and generating language, reducing the need for extensive oracle interactions.
Findings
Self-play improves language understanding and generation in agents.
Agents trained with self-play transfer skills across roles.
Enhanced language acquisition with minimal supervision.
Abstract
When communicating, people behave consistently across conversational roles: People understand the words they say and are able to produce the words they hear. To date, artificial agents developed for language tasks have lacked such symmetry, meaning agents trained to produce language are unable to understand it and vice-versa. In this work, we exploit the symmetric nature of communication in order to improve both the efficiency and quality of language acquisition in learning agents. Specifically, we consider the setting in which an agent must learn to both understand and generate words in an existing language, but with the assumption that access to interaction with "oracle" speakers of the language is very limited. We show that using self-play as a substitute for direct supervision enables the agent to transfer its knowledge across roles (e.g. training as a listener but testing as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Multimodal Machine Learning Applications · Natural Language Processing Techniques
