Learning Disentangled Semantic Representations for Zero-Shot Cross-Lingual Transfer in Multilingual Machine Reading Comprehension
Linjuan Wu, Shaojuan Wu, Xiaowang Zhang, Deyi Xiong, Shizhan Chen,, Zhiqiang Zhuang, Zhiyong Feng

TL;DR
This paper introduces a novel framework with a Siamese Semantic Disentanglement Model to improve zero-shot cross-lingual transfer in multilingual machine reading comprehension by separating semantic and syntactic information.
Contribution
It proposes a new semantic disentanglement approach with tailored losses to enhance zero-shot transfer in multilingual MRC tasks.
Findings
Outperforms mBERT and XLM-100 based models on XQuAD, MLQA, and TyDi QA datasets.
Effectively disentangles semantics from syntax in multilingual representations.
Improves answer span accuracy in low-resource languages.
Abstract
Multilingual pre-trained models are able to zero-shot transfer knowledge from rich-resource to low-resource languages in machine reading comprehension (MRC). However, inherent linguistic discrepancies in different languages could make answer spans predicted by zero-shot transfer violate syntactic constraints of the target language. In this paper, we propose a novel multilingual MRC framework equipped with a Siamese Semantic Disentanglement Model (SSDM) to disassociate semantics from syntax in representations learned by multilingual pre-trained models. To explicitly transfer only semantic knowledge to the target language, we propose two groups of losses tailored for semantic and syntactic encoding and disentanglement. Experimental results on three multilingual MRC datasets (i.e., XQuAD, MLQA, and TyDi QA) demonstrate the effectiveness of our proposed approach over models based on mBERT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsmBERT
