Improving Span-based Question Answering Systems with Coarsely Labeled   Data

Hao Cheng; Ming-Wei Chang; Kenton Lee; Ankur Parikh; Michael Collins,; Kristina Toutanova

arXiv:1811.02076·cs.CL·November 7, 2018·1 cites

Improving Span-based Question Answering Systems with Coarsely Labeled Data

Hao Cheng, Ming-Wei Chang, Kenton Lee, Ankur Parikh, Michael Collins,, Kristina Toutanova

PDF

Open Access

TL;DR

This paper enhances span-based question answering systems by effectively leveraging coarse-grained paragraph relevance data through latent-variable modeling and novel training objectives, leading to significant performance improvements.

Contribution

It introduces explicit latent-variable models and posterior distillation methods to better utilize coarse annotations in fine-grained QA systems, surpassing standard multi-task learning approaches.

Findings

01

Latent-variable models outperform standard multi-task learning.

02

Explicit modeling of fine-coarse relationships yields larger gains.

03

Coarse supervision significantly improves QA accuracy.

Abstract

We study approaches to improve fine-grained short answer Question Answering models by integrating coarse-grained data annotated for paragraph-level relevance and show that coarsely annotated data can bring significant performance gains. Experiments demonstrate that the standard multi-task learning approach of sharing representations is not the most effective way to leverage coarse-grained annotations. Instead, we can explicitly model the latent fine-grained short answer variables and optimize the marginal log-likelihood directly or use a newly proposed \emph{posterior distillation} learning objective. Since these latent-variable methods have explicit access to the relationship between the fine and coarse tasks, they result in significantly larger improvements from coarse supervision.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems