Learning Natural Language Inference using Bidirectional LSTM model and   Inner-Attention

Yang Liu; Chengjie Sun; Lei Lin; Xiaolong Wang

arXiv:1605.09090·cs.CL·May 31, 2016·234 cites

Learning Natural Language Inference using Bidirectional LSTM model and Inner-Attention

Yang Liu, Chengjie Sun, Lei Lin, Xiaolong Wang

PDF

Open Access 2 Repos

TL;DR

This paper introduces an improved sentence encoding model for natural language inference that employs a novel inner-attention mechanism, outperforming previous methods on the SNLI dataset with fewer parameters.

Contribution

The paper proposes a new inner-attention mechanism within a bidirectional LSTM framework for sentence encoding in natural language inference tasks.

Findings

01

Inner-attention improves sentence representations.

02

Model outperforms existing approaches on SNLI.

03

Fewer parameters needed for better accuracy.

Abstract

In this paper, we proposed a sentence encoding-based model for recognizing text entailment. In our approach, the encoding of sentence is a two-stage process. Firstly, average pooling was used over word-level bidirectional LSTM (biLSTM) to generate a first-stage sentence representation. Secondly, attention mechanism was employed to replace average pooling on the same sentence for better representations. Instead of using target sentence to attend words in source sentence, we utilized the sentence's first-stage representation to attend words appeared in itself, which is called "Inner-Attention" in our paper . Experiments conducted on Stanford Natural Language Inference (SNLI) Corpus has proved the effectiveness of "Inner-Attention" mechanism. With less number of parameters, our model outperformed the existing best sentence encoding-based approach by a large margin.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsAverage Pooling