Cross-Lingual Approaches to Reference Resolution in Dialogue Systems
Amr Sharaf, Arpit Gupta, Hancheng Ge, Chetan Naik, Lambert Mathias

TL;DR
This paper investigates cross-lingual methods for reference resolution in dialogue systems, aiming to reduce the need for annotated data in target languages by leveraging multilingual embeddings, delexicalization, and machine translation.
Contribution
It compares three cross-lingual transfer approaches for reference resolution, highlighting their effectiveness in low-resource and high-resource settings.
Findings
Multilingual embeddings and delexicalization improve low-resource performance.
Gains diminish as target language data increases.
Machine translation combined with these methods approaches ideal performance with minimal data.
Abstract
In the slot-filling paradigm, where a user can refer back to slots in the context during the conversation, the goal of the contextual understanding system is to resolve the referring expressions to the appropriate slots in the context. In this paper, we build on the context carryover system~\citep{Naik2018ContextualSC}, which provides a scalable multi-domain framework for resolving references. However, scaling this approach across languages is not a trivial task, due to the large demand on acquisition of annotated data in the target language. Our main focus is on cross-lingual methods for reference resolution as a way to alleviate the need for annotated data in the target language. In the cross-lingual setup, we assume there is access to annotated resources as well as a well trained model in the source language and little to no annotated data in the target language. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
