TL;DR
This paper investigates what neural dialog models learn internally, revealing their limitations in understanding and generating conversations, and emphasizes the need for improved architectures and training methods for better conversational skills.
Contribution
It provides an in-depth analysis of internal representations in neural dialog systems and identifies key areas where they underperform in conversational understanding.
Findings
Models struggle with answering questions and inferring contradictions.
Standard models do not effectively leverage turn-taking dynamics.
Highlighting the need for architectures capturing high-level dialog information.
Abstract
The predominant approach to open-domain dialog generation relies on end-to-end training of neural models on chat datasets. However, this approach provides little insight as to what these models learn (or do not learn) about engaging in dialog. In this study, we analyze the internal representations learned by neural open-domain dialog systems and evaluate the quality of these representations for learning basic conversational skills. Our results suggest that standard open-domain dialog systems struggle with answering questions, inferring contradiction, and determining the topic of conversation, among other tasks. We also find that the dyadic, turn-taking nature of dialog is not fully leveraged by these models. By exploring these limitations, we highlight the need for additional research into architectures and training methods that can better capture high-level information about dialog.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
