Cross-lingual Intermediate Fine-tuning improves Dialogue State Tracking
Nikita Moghe, Mark Steedman, Alexandra Birch

TL;DR
This paper introduces a novel cross-lingual intermediate fine-tuning method for pretrained multilingual models, significantly improving dialogue state tracking performance across multiple languages with minimal data and zero-shot learning.
Contribution
It proposes using parallel movie subtitles for intermediate fine-tuning of multilingual models, enhancing cross-lingual transfer for dialogue state tracking tasks.
Findings
Over 20% improvement in joint goal accuracy on MultiWoZ dataset
Effective with only 10% of target language data
Achieves zero-shot performance on Multilingual WoZ
Abstract
Recent progress in task-oriented neural dialogue systems is largely focused on a handful of languages, as annotation of training data is tedious and expensive. Machine translation has been used to make systems multilingual, but this can introduce a pipeline of errors. Another promising solution is using cross-lingual transfer learning through pretrained multilingual models. Existing methods train multilingual models with additional code-mixed task data or refine the cross-lingual representations through parallel ontologies. In this work, we enhance the transfer learning process by intermediate fine-tuning of pretrained multilingual models, where the multilingual models are fine-tuned with different but related data and/or tasks. Specifically, we use parallel and conversational movie subtitles datasets to design cross-lingual intermediate tasks suitable for downstream dialogue tasks. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsTest
