Using calibrator to improve robustness in Machine Reading Comprehension

Jing Jin; Houfeng Wang

arXiv:2202.11865·cs.CL·February 25, 2022

Using calibrator to improve robustness in Machine Reading Comprehension

Jing Jin, Houfeng Wang

PDF

Open Access

TL;DR

This paper introduces a post-hoc calibrator based on XGBoost to enhance the robustness of Machine Reading Comprehension models against adversarial attacks and data shifts, improving performance across multiple datasets.

Contribution

It proposes a novel calibrator method that combines manual and learned features for reranking, improving robustness without degrading original dataset performance.

Findings

01

Performance improved by over 10% on adversarial datasets

02

Enhanced robustness without loss on original data

03

Effective reranking method for MRC models

Abstract

Machine Reading Comprehension(MRC) has achieved a remarkable result since some powerful models, such as BERT, are proposed. However, these models are not robust enough and vulnerable to adversarial input perturbation and generalization examples. Some works tried to improve the performance on specific types of data by adding some related examples into training data while it leads to degradation on the original dataset, because the shift of data distribution makes the answer ranking based on the softmax probability of model unreliable. In this paper, we propose a method to improve the robustness by using a calibrator as the post-hoc reranker, which is implemented based on XGBoost model. The calibrator combines both manual features and representation learning features to rerank candidate results. Experimental results on adversarial datasets show that our model can achieve performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Adversarial Robustness in Machine Learning

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dense Connections · Residual Connection · Weight Decay · Linear Warmup With Linear Decay · WordPiece · Layer Normalization · Adam