Enhanced LSTM for Natural Language Inference

Qian Chen; Xiaodan Zhu; Zhenhua Ling; Si Wei; Hui Jiang; Diana Inkpen

arXiv:1609.06038·cs.CL·March 4, 2020

Enhanced LSTM for Natural Language Inference

Qian Chen, Xiaodan Zhu, Zhenhua Ling, Si Wei, Hui Jiang, Diana Inkpen

PDF

5 Repos

TL;DR

This paper introduces an enhanced LSTM-based model for natural language inference that achieves state-of-the-art accuracy by combining sequential and recursive architectures, with syntactic parsing information further boosting performance.

Contribution

It demonstrates that carefully designed chain LSTM models can outperform complex architectures, and that recursive structures with syntactic parsing improve inference accuracy.

Findings

01

Achieved 88.6% accuracy on SNLI dataset.

02

Sequential LSTM models outperform previous complex models.

03

Recursive architectures with parsing information further improve results.

Abstract

Reasoning and inference are central to human and artificial intelligence. Modeling inference in human language is very challenging. With the availability of large annotated data (Bowman et al., 2015), it has recently become feasible to train neural network based inference models, which have shown to be very effective. In this paper, we present a new state-of-the-art result, achieving the accuracy of 88.6% on the Stanford Natural Language Inference Dataset. Unlike the previous top models that use very complicated network architectures, we first demonstrate that carefully designing sequential inference models based on chain LSTMs can outperform all previous models. Based on this, we further show that by explicitly considering recursive architectures in both local inference modeling and inference composition, we achieve additional improvement. Particularly, incorporating syntactic parsing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsEnhanced Sequential Inference Model