Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling
Jitin Krishnan, Antonios Anastasopoulos, Hemant Purohit, and Huzefa, Rangwala

TL;DR
This paper introduces a novel multilingual code-switching data augmentation method to improve zero-shot cross-lingual intent prediction and slot filling, demonstrating significant accuracy gains across multiple languages and real-world crisis data.
Contribution
The study proposes a new data augmentation technique using multilingual code-switching to enhance language neutrality in transformers for zero-shot NLU tasks.
Findings
Achieved +4.2% accuracy in intent prediction across 8 languages.
Improved F1 score by +1.8% for slot filling.
Validated effectiveness on a new Haitian Creole-English dataset during a disaster.
Abstract
Predicting user intent and detecting the corresponding slots from text are two key problems in Natural Language Understanding (NLU). In the context of zero-shot learning, this task is typically approached by either using representations from pre-trained multilingual transformers such as mBERT, or by machine translating the source data into the known target language and then fine-tuning. Our work focuses on a particular scenario where the target language is unknown during training. To this goal, we propose a novel method to augment the monolingual source data using multilingual code-switching via random translations to enhance a transformer's language neutrality when fine-tuning it for a downstream task. This method also helps discover novel insights on how code-switching with different language families around the world impact the performance on the target language. Experiments on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Natural Language Processing Techniques
MethodsmBERT
