Towards Confident Machine Reading Comprehension

Rishav Chakravarti; Avirup Sil

arXiv:2101.07942·cs.CL·February 25, 2021·1 cites

Towards Confident Machine Reading Comprehension

Rishav Chakravarti, Avirup Sil

PDF

Open Access

TL;DR

This paper introduces Mr.C, a confidence estimation model for machine reading comprehension that improves the system's ability to avoid incorrect answers and better predict answerability, especially under domain shift conditions.

Contribution

The paper presents a novel confidence estimation approach using gradient-based features, enhancing answer correctness and answerability predictions in MRC systems.

Findings

01

Mr.C improves AUC scores by up to 4 points.

02

Mr.C enhances answerability prediction by 5 F1 points.

03

Gradient-based features contribute significantly to confidence estimation.

Abstract

There has been considerable progress on academic benchmarks for the Reading Comprehension (RC) task with State-of-the-Art models closing the gap with human performance on extractive question answering. Datasets such as SQuAD 2.0 & NQ have also introduced an auxiliary task requiring models to predict when a question has no answer in the text. However, in production settings, it is also necessary to provide confidence estimates for the performance of the underlying RC model at both answer extraction and "answerability" detection. We propose a novel post-prediction confidence estimation model, which we call Mr.C (short for Mr. Confident), that can be trained to improve a system's ability to refrain from making incorrect predictions with improvements of up to 4 points as measured by Area Under the Curve (AUC) scores. Mr.C can benefit from a novel white-box feature that leverages the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications