Enhancing Answer Boundary Detection for Multilingual Machine Reading   Comprehension

Fei Yuan; Linjun Shou; Xuanyu Bai; Ming Gong; Yaobo Liang; Nan Duan,; Yan Fu; Daxin Jiang

arXiv:2004.14069·cs.CL·May 11, 2020·6 cites

Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension

Fei Yuan, Linjun Shou, Xuanyu Bai, Ming Gong, Yaobo Liang, Nan Duan,, Yan Fu, Daxin Jiang

PDF

Open Access

TL;DR

This paper introduces auxiliary tasks during fine-tuning of multilingual models to improve answer boundary detection in cross-lingual machine reading comprehension, leveraging cross-lingual data and knowledge masking.

Contribution

It proposes two novel auxiliary tasks for fine-tuning that enhance answer boundary detection in multilingual MRC models, especially for low-resource languages.

Findings

01

Significant improvement in cross-lingual MRC performance

02

Effective use of phrase boundary supervision techniques

03

Enhanced transferability across languages

Abstract

Multilingual pre-trained models could leverage the training data from a rich source language (such as English) to improve performance on low resource languages. However, the transfer quality for multilingual Machine Reading Comprehension (MRC) is significantly worse than sentence classification tasks mainly due to the requirement of MRC to detect the word level answer boundary. In this paper, we propose two auxiliary tasks in the fine-tuning stage to create additional phrase boundary supervision: (1) A mixed MRC task, which translates the question or passage to other languages and builds cross-lingual question-passage pairs; (2) A language-agnostic knowledge masking task by leveraging knowledge phrases mined from web. Besides, extensive experiments on two cross-lingual MRC datasets show the effectiveness of our proposed approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications