MaP: A Matrix-based Prediction Approach to Improve Span Extraction in   Machine Reading Comprehension

Huaishao Luo; Yu Shi; Ming Gong; Linjun Shou; Tianrui Li

arXiv:2009.14348·cs.CL·October 1, 2020

MaP: A Matrix-based Prediction Approach to Improve Span Extraction in Machine Reading Comprehension

Huaishao Luo, Yu Shi, Ming Gong, Linjun Shou, Tianrui Li

PDF

Open Access

TL;DR

This paper introduces MaP, a novel matrix-based span prediction method for machine reading comprehension that improves answer span extraction accuracy by covering more start-end pairs and employs a sampling strategy to manage computational costs.

Contribution

The paper presents a new probability matrix approach for span prediction and a sampling-based training strategy, enhancing existing models like BERT and BiDAF.

Findings

01

Consistent performance improvements on SQuAD 1.1 and other benchmarks.

02

Effective in covering more start-end span pairs.

03

Compatible with state-of-the-art models.

Abstract

Span extraction is an essential problem in machine reading comprehension. Most of the existing algorithms predict the start and end positions of an answer span in the given corresponding context by generating two probability vectors. In this paper, we propose a novel approach that extends the probability vector to a probability matrix. Such a matrix can cover more start-end position pairs. Precisely, to each possible start index, the method always generates an end probability vector. Besides, we propose a sampling-based training strategy to address the computational cost and memory issue in the matrix training phase. We evaluate our method on SQuAD 1.1 and three other question answering benchmarks. Leveraging the most competitive models BERT and BiDAF as the backbone, our proposed approach can get consistent improvements in all datasets, demonstrating the effectiveness of the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsLinear Layer · Adam · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Dropout · Linear Warmup With Linear Decay · Layer Normalization · Attention Dropout · WordPiece