Stochastic Answer Networks for Natural Language Inference
Xiaodong Liu, Kevin Duh, Jianfeng Gao

TL;DR
This paper introduces a stochastic answer network that iteratively refines predictions for natural language inference, achieving state-of-the-art results on multiple benchmarks.
Contribution
The paper presents a novel stochastic answer network that employs multi-step inference strategies for improved natural language inference performance.
Findings
SAN achieves state-of-the-art results on SNLI, MultiNLI, and Quora datasets.
Iterative refinement improves inference accuracy.
Model outperforms previous methods on benchmark datasets.
Abstract
We propose a stochastic answer network (SAN) to explore multi-step inference strategies in Natural Language Inference. Rather than directly predicting the results given the inputs, the model maintains a state and iteratively refines its predictions. Our experiments show that SAN achieves the state-of-the-art results on three benchmarks: Stanford Natural Language Inference (SNLI) dataset, MultiGenre Natural Language Inference (MultiNLI) dataset and Quora Question Pairs dataset.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
