Multilingual Neural Semantic Parsing for Low-Resourced Languages

Menglin Xia; Emilio Monti

arXiv:2106.03469·cs.CL·June 15, 2021

Multilingual Neural Semantic Parsing for Low-Resourced Languages

Menglin Xia, Emilio Monti

PDF

Open Access 1 Repo

TL;DR

This paper presents a multilingual semantic parsing approach for low-resource languages using machine translation and transfer learning, demonstrating improved performance and establishing new baselines on multiple datasets.

Contribution

It introduces a novel multilingual training method leveraging machine translation and pretrained encoders, along with a new dataset for evaluation.

Findings

01

Joint multilingual training outperforms baselines on TOP dataset.

02

The approach surpasses state-of-the-art on NLMaps.

03

Zero-shot performance on Italian reaches 44.9% accuracy.

Abstract

Multilingual semantic parsing is a cost-effective method that allows a single model to understand different languages. However, researchers face a great imbalance of availability of training data, with English being resource rich, and other languages having much less data. To tackle the data limitation problem, we propose using machine translation to bootstrap multilingual training data from the more abundant English data. To compensate for the data quality of machine translated training data, we utilize transfer learning from pretrained multilingual encoders to further improve the model. To evaluate our multilingual models on human-written sentences as opposed to machine translated ones, we introduce a new multilingual semantic parsing dataset in English, Italian and Japanese based on the Facebook Task Oriented Parsing (TOP) dataset. We show that joint multilingual training with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

awslabs/multilingual-top
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications