Learning with Instance Bundles for Reading Comprehension

Dheeru Dua; Pradeep Dasigi; Sameer Singh; Matt Gardner

arXiv:2104.08735·cs.CL·April 20, 2021

Learning with Instance Bundles for Reading Comprehension

Dheeru Dua, Pradeep Dasigi, Sameer Singh, Matt Gardner

PDF

TL;DR

This paper introduces novel supervision techniques for reading comprehension models that leverage relationships among related question-answer pairs within bundles, improving accuracy significantly.

Contribution

It proposes new contrastive supervision methods using instance bundles, enhancing training signals beyond traditional maximum likelihood estimation.

Findings

01

Up to 11% accuracy improvement on HotpotQA.

02

Effective use of related question-answer bundles for training.

03

Demonstrated benefits on ROPES dataset.

Abstract

When training most modern reading comprehension models, all the questions associated with a context are treated as being independent from each other. However, closely related questions and their corresponding answers are not independent, and leveraging these relationships could provide a strong supervision signal to a model. Drawing on ideas from contrastive estimation, we introduce several new supervision techniques that compare question-answer scores across multiple related instances. Specifically, we normalize these scores across various neighborhoods of closely contrasting questions and/or answers, adding another cross entropy loss term that is used in addition to traditional maximum likelihood estimation. Our techniques require bundles of related question-answer pairs, which we can either mine from within existing data or create using various automated heuristics. We empirically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.