Rethinking the Objectives of Extractive Question Answering

Martin Fajcik; Josef Jon; Pavel Smrz

arXiv:2008.12804·cs.CL·October 13, 2021

Rethinking the Objectives of Extractive Question Answering

Martin Fajcik, Josef Jon, Pavel Smrz

PDF

1 Repo

TL;DR

This paper challenges the independence assumption in extractive question answering models and introduces a joint probability modeling approach with a compound objective, improving accuracy across multiple models and datasets.

Contribution

It proposes a novel joint probability modeling method with a compound objective, surpassing traditional independence assumptions in extractive QA.

Findings

01

Compound objective improves exact match scores.

02

Independence assumption causes common errors.

03

Method effective across multiple models and datasets.

Abstract

This work demonstrates that using the objective with independence assumption for modelling the span probability $P (a_{s}, a_{e}) = P (a_{s}) P (a_{e})$ of span starting at position $a_{s}$ and ending at position $a_{e}$ has adverse effects. Therefore we propose multiple approaches to modelling joint probability $P (a_{s}, a_{e})$ directly. Among those, we propose a compound objective, composed from the joint probability while still keeping the objective with independence assumption as an auxiliary objective. We find that the compound objective is consistently superior or equal to other assumptions in exact match. Additionally, we identified common errors caused by the assumption of independence and manually checked the counterpart predictions, demonstrating the impact of the compound objective on the real examples. Our findings are supported via experiments with three extractive QA models (BIDAF, BERT,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KNOT-FIT-BUT/JointSpanExtraction
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · Softmax · Dense Connections · Linear Warmup With Linear Decay · Layer Normalization · Attention Dropout · Attention Is All You Need · Adam