A Multilingual Modeling Method for Span-Extraction Reading Comprehension
Gaochen Wu, Bin Xu, Dejie Chang, Bangchang Liu

TL;DR
This paper introduces XLRC, a multilingual span-extraction reading comprehension model that leverages translated datasets and novel attention mechanisms to improve performance across languages with limited training data.
Contribution
The paper proposes a novel multilingual approach with self-adaptive and multilingual attention mechanisms, enabling transfer learning for reading comprehension in low-resource languages.
Findings
Outperforms RoBERTa_Large on CMRC 2018
Effective multilingual transfer learning
Enhances semantic relation extraction across languages
Abstract
Span-extraction reading comprehension models have made tremendous advances enabled by the availability of large-scale, high-quality training datasets. Despite such rapid progress and widespread application, extractive reading comprehension datasets in languages other than English remain scarce, and creating such a sufficient amount of training data for each language is costly and even impossible. An alternative to creating large-scale high-quality monolingual span-extraction training datasets is to develop multilingual modeling approaches and systems which can transfer to the target language without requiring training data in that language. In this paper, in order to solve the scarce availability of extractive reading comprehension training data in the target language, we propose a multilingual extractive reading comprehension approach called XLRC by simultaneously modeling the existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
