Cross-lingual Intermediate Fine-tuning improves Dialogue State Tracking

Nikita Moghe; Mark Steedman; Alexandra Birch

arXiv:2109.13620·cs.CL·September 29, 2021

Cross-lingual Intermediate Fine-tuning improves Dialogue State Tracking

Nikita Moghe, Mark Steedman, Alexandra Birch

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel cross-lingual intermediate fine-tuning method for pretrained multilingual models, significantly improving dialogue state tracking performance across multiple languages with minimal data and zero-shot learning.

Contribution

It proposes using parallel movie subtitles for intermediate fine-tuning of multilingual models, enhancing cross-lingual transfer for dialogue state tracking tasks.

Findings

01

Over 20% improvement in joint goal accuracy on MultiWoZ dataset

02

Effective with only 10% of target language data

03

Achieves zero-shot performance on Multilingual WoZ

Abstract

Recent progress in task-oriented neural dialogue systems is largely focused on a handful of languages, as annotation of training data is tedious and expensive. Machine translation has been used to make systems multilingual, but this can introduce a pipeline of errors. Another promising solution is using cross-lingual transfer learning through pretrained multilingual models. Existing methods train multilingual models with additional code-mixed task data or refine the cross-lingual representations through parallel ontologies. In this work, we enhance the transfer learning process by intermediate fine-tuning of pretrained multilingual models, where the multilingual models are fine-tuned with different but related data and/or tasks. Specifically, we use parallel and conversational movie subtitles datasets to design cross-lingual intermediate tasks suitable for downstream dialogue tasks. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nikitacs16/xlift_dst
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsTest