Stochastic Answer Networks for Natural Language Inference

Xiaodong Liu; Kevin Duh; Jianfeng Gao

arXiv:1804.07888·cs.CL·April 2, 2019·51 cites

Stochastic Answer Networks for Natural Language Inference

Xiaodong Liu, Kevin Duh, Jianfeng Gao

PDF

Open Access 3 Repos

TL;DR

This paper introduces a stochastic answer network that iteratively refines predictions for natural language inference, achieving state-of-the-art results on multiple benchmarks.

Contribution

The paper presents a novel stochastic answer network that employs multi-step inference strategies for improved natural language inference performance.

Findings

01

SAN achieves state-of-the-art results on SNLI, MultiNLI, and Quora datasets.

02

Iterative refinement improves inference accuracy.

03

Model outperforms previous methods on benchmark datasets.

Abstract

We propose a stochastic answer network (SAN) to explore multi-step inference strategies in Natural Language Inference. Rather than directly predicting the results given the inputs, the model maintains a state and iteratively refines its predictions. Our experiments show that SAN achieves the state-of-the-art results on three benchmarks: Stanford Natural Language Inference (SNLI) dataset, MultiGenre Natural Language Inference (MultiNLI) dataset and Quora Question Pairs dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications