Multilingual Neural Semantic Parsing for Low-Resourced Languages
Menglin Xia, Emilio Monti

TL;DR
This paper presents a multilingual semantic parsing approach for low-resource languages using machine translation and transfer learning, demonstrating improved performance and establishing new baselines on multiple datasets.
Contribution
It introduces a novel multilingual training method leveraging machine translation and pretrained encoders, along with a new dataset for evaluation.
Findings
Joint multilingual training outperforms baselines on TOP dataset.
The approach surpasses state-of-the-art on NLMaps.
Zero-shot performance on Italian reaches 44.9% accuracy.
Abstract
Multilingual semantic parsing is a cost-effective method that allows a single model to understand different languages. However, researchers face a great imbalance of availability of training data, with English being resource rich, and other languages having much less data. To tackle the data limitation problem, we propose using machine translation to bootstrap multilingual training data from the more abundant English data. To compensate for the data quality of machine translated training data, we utilize transfer learning from pretrained multilingual encoders to further improve the model. To evaluate our multilingual models on human-written sentences as opposed to machine translated ones, we introduce a new multilingual semantic parsing dataset in English, Italian and Japanese based on the Facebook Task Oriented Parsing (TOP) dataset. We show that joint multilingual training with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
