Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems
Evgeniia Razumovskaia, Goran Glava\v{s}, Olga Majewska, Edoardo M., Ponti, Anna Korhonen, Ivan Vuli\'c

TL;DR
This paper reviews the current state of multilingual task-oriented dialogue systems, highlighting data scarcity, reliance on transfer learning, and challenges in expanding language coverage and evaluation methods.
Contribution
It provides an extensive overview of methods, resources, and challenges in multilingual ToD, proposing parallels with other NLP tasks to inspire solutions for low-resource languages.
Findings
Data scarcity limits multilingual ToD development.
Current approaches rely on cross-lingual transfer from resource-rich languages.
Evaluation benchmarks for diverse languages are lacking.
Abstract
In task-oriented dialogue (ToD), a user holds a conversation with an artificial agent to complete a concrete task. Although this technology represents one of the central objectives of AI and has been the focus of ever more intense research and development efforts, it is currently limited to a few narrow domains (e.g., food ordering, ticket booking) and a handful of languages (e.g., English, Chinese). This work provides an extensive overview of existing methods and resources in multilingual ToD as an entry point to this exciting and emerging field. We find that the most critical factor preventing the creation of truly multilingual ToD systems is the lack of datasets in most languages for both training and evaluation. In fact, acquiring annotations or human feedback for each component of modular systems or for data-hungry end-to-end systems is expensive and tedious. Hence,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · Natural Language Processing Techniques
