When to Fold'em: How to answer Unanswerable questions

Marshall Ho; Zhipeng Zhou; Judith He

arXiv:2105.00328·cs.CL·May 4, 2021

When to Fold'em: How to answer Unanswerable questions

Marshall Ho, Zhipeng Zhou, Judith He

PDF

Open Access 1 Repo

TL;DR

This paper compares three question-answering models trained on SQuAD2.0, introduces a novel fine-tuning approach that improves F1 scores by 2% with less training, and highlights the effectiveness of re-initializing specific model layers.

Contribution

A new fine-tuning method involving re-initializing select layers of a shared language model, leading to improved performance and reduced training time.

Findings

01

Achieved a 2% increase in SQuAD2.0 F1 score.

02

Demonstrated the effectiveness of re-initializing layers in pre-trained models.

03

Compared BIDAF, DocumentQA, and ALBERT Retro-Reader models.

Abstract

We present 3 different question-answering models trained on the SQuAD2.0 dataset -- BIDAF, DocumentQA and ALBERT Retro-Reader -- demonstrating the improvement of language models in the past three years. Through our research in fine-tuning pre-trained models for question-answering, we developed a novel approach capable of achieving a 2% point improvement in SQuAD2.0 F1 in reduced training time. Our method of re-initializing select layers of a parameter-shared language model is simple yet empirically powerful.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allenai/document-qa
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsMulti-Head Attention · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam · Softmax · WordPiece · Dense Connections · LAMB · Attention Is All You Need · Residual Connection