Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a   Distilled Representation

Mehrad Moradshahi; Sina J. Semnani; Monica S. Lam

arXiv:2302.09424·cs.CL·February 21, 2023

Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a Distilled Representation

Mehrad Moradshahi, Sina J. Semnani, Monica S. Lam

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel end-to-end approach for zero and few-shot localization of task-oriented dialogue agents across languages, significantly reducing the need for target language training data while maintaining high accuracy.

Contribution

It presents methods to build high-quality dialogue agents in new languages using improved data representation, entity-aware translation, and filtering, outperforming prior cross-lingual approaches.

Findings

01

Zero-shot Task Success Rate: 46.7% in Chinese-English transfer.

02

Few-shot improvement: 15.2% over state-of-the-art.

03

Approach narrows gap to full-shot performance within 5%.

Abstract

Task-oriented Dialogue (ToD) agents are mostly limited to a few widely-spoken languages, mainly due to the high cost of acquiring training data for each language. Existing low-cost approaches that rely on cross-lingual embeddings or naive machine translation sacrifice a lot of accuracy for data efficiency, and largely fail in creating a usable dialogue agent. We propose automatic methods that use ToD training data in a source language to build a high-quality functioning dialogue agent in another target language that has no training data (i.e. zero-shot) or a small training set (i.e. few-shot). Unlike most prior work in cross-lingual ToD that only focuses on Dialogue State Tracking (DST), we build an end-to-end agent. We show that our approach closes the accuracy gap between few-shot and existing full-shot methods for ToD agents. We achieve this by (1) improving the dialogue data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stanford-oval/dialogues
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

Methodsfail