MaP: A Matrix-based Prediction Approach to Improve Span Extraction in Machine Reading Comprehension
Huaishao Luo, Yu Shi, Ming Gong, Linjun Shou, Tianrui Li

TL;DR
This paper introduces MaP, a novel matrix-based span prediction method for machine reading comprehension that improves answer span extraction accuracy by covering more start-end pairs and employs a sampling strategy to manage computational costs.
Contribution
The paper presents a new probability matrix approach for span prediction and a sampling-based training strategy, enhancing existing models like BERT and BiDAF.
Findings
Consistent performance improvements on SQuAD 1.1 and other benchmarks.
Effective in covering more start-end span pairs.
Compatible with state-of-the-art models.
Abstract
Span extraction is an essential problem in machine reading comprehension. Most of the existing algorithms predict the start and end positions of an answer span in the given corresponding context by generating two probability vectors. In this paper, we propose a novel approach that extends the probability vector to a probability matrix. Such a matrix can cover more start-end position pairs. Precisely, to each possible start index, the method always generates an end probability vector. Besides, we propose a sampling-based training strategy to address the computational cost and memory issue in the matrix training phase. We evaluate our method on SQuAD 1.1 and three other question answering benchmarks. Leveraging the most competitive models BERT and BiDAF as the backbone, our proposed approach can get consistent improvements in all datasets, demonstrating the effectiveness of the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsLinear Layer · Adam · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Dropout · Linear Warmup With Linear Decay · Layer Normalization · Attention Dropout · WordPiece
