On the Robustness of Reading Comprehension Models to Entity Renaming

Jun Yan; Yang Xiao; Sagnik Mukherjee; Bill Yuchen Lin; Robin Jia,; Xiang Ren

arXiv:2110.08555·cs.CL·May 5, 2022·1 cites

On the Robustness of Reading Comprehension Models to Entity Renaming

Jun Yan, Yang Xiao, Sagnik Mukherjee, Bill Yuchen Lin, Robin Jia,, Xiang Ren

PDF

Open Access 1 Repo

TL;DR

This paper investigates how machine reading comprehension models' accuracy is affected by renaming entities in questions, revealing their reliance on entity names and proposing methods to improve robustness.

Contribution

It introduces a scalable pipeline to test model robustness to entity renaming and compares different models and training strategies for improved resilience.

Findings

01

Models perform worse when entities are renamed across datasets.

02

SpanBERT is more robust than RoBERTa despite similar baseline accuracy.

03

Entity-based masking during pretraining enhances model robustness.

Abstract

We study the robustness of machine reading comprehension (MRC) models to entity renaming -- do models make more wrong predictions when the same questions are asked about an entity whose name has been changed? Such failures imply that models overly rely on entity information to answer questions, and thus may generalize poorly when facts about the world change or questions are asked about novel entities. To systematically audit this issue, we present a pipeline to automatically generate test examples at scale, by replacing entity names in the original test sample with names from a variety of sources, ranging from names in the same test set, to common names in life, to arbitrary strings. Across five datasets and three pretrained model architectures, MRC models consistently perform worse when entities are renamed, with particularly large accuracy drops on datasets constructed via distant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ink-usc/entity-robustness
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies

MethodsAttention Is All You Need · Test · Linear Layer · Weight Decay · Softmax · Linear Warmup With Linear Decay · Residual Connection · WordPiece · Attention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia?