Stochastic Answer Networks for Machine Reading Comprehension

Xiaodong Liu; Yelong Shen; Kevin Duh; Jianfeng Gao

arXiv:1712.03556·cs.CL·May 16, 2018·34 cites

Stochastic Answer Networks for Machine Reading Comprehension

Xiaodong Liu, Yelong Shen, Kevin Duh, Jianfeng Gao

PDF

Open Access 5 Repos

TL;DR

This paper introduces a stochastic answer network (SAN) that enhances machine reading comprehension by employing stochastic dropout during training, leading to improved robustness and competitive results on multiple datasets.

Contribution

The paper presents a novel stochastic dropout technique in answer modules for multi-step reasoning, improving robustness over previous reinforcement learning approaches.

Findings

01

Achieves state-of-the-art or competitive results on SQuAD, Adversarial SQuAD, and MS MARCO datasets.

02

Demonstrates that stochastic dropout improves model robustness.

03

Simplifies multi-step reasoning without reinforcement learning.

Abstract

We propose a simple yet robust stochastic answer network (SAN) that simulates multi-step reasoning in machine reading comprehension. Compared to previous work such as ReasoNet which used reinforcement learning to determine the number of steps, the unique feature is the use of a kind of stochastic prediction dropout on the answer module (final layer) of the neural network during the training. We show that this simple trick improves robustness and achieves results competitive to the state-of-the-art on the Stanford Question Answering Dataset (SQuAD), the Adversarial SQuAD, and the Microsoft MAchine Reading COmprehension Dataset (MS MARCO).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques