Making Neural QA as Simple as Possible but not Simpler

Dirk Weissenborn; Georg Wiese; Laura Seiffe

arXiv:1703.04816·cs.CL·June 9, 2017·43 cites

Making Neural QA as Simple as Possible but not Simpler

Dirk Weissenborn, Georg Wiese, Laura Seiffe

PDF

Open Access 3 Repos

TL;DR

This paper demonstrates that a simple neural baseline for extractive question answering, using question word awareness and advanced composition functions, can achieve competitive performance, questioning the necessity of complex models.

Contribution

The paper introduces a simple heuristic for neural QA systems, emphasizing question word awareness and advanced composition functions, achieving high performance with minimal complexity.

Findings

01

FastQA achieves competitive results on QA datasets.

02

Simple heuristics can rival complex neural models.

03

Complexity may not always be necessary for high performance.

Abstract

Recent development of large-scale question answering (QA) datasets triggered a substantial amount of research into end-to-end neural architectures for QA. Increasingly complex systems have been conceived without comparison to simpler neural baseline systems that would justify their complexity. In this work, we propose a simple heuristic that guides the development of neural baseline systems for the extractive QA task. We find that there are two ingredients necessary for building a high-performing neural QA system: first, the awareness of question words while processing the context and second, a composition function that goes beyond simple bag-of-words modeling, such as recurrent neural networks. Our results show that FastQA, a system that meets these two requirements, can achieve very competitive performance compared with existing models. We argue that this surprising finding puts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications