Character-level Intra Attention Network for Natural Language Inference

Han Yang; Marta R. Costa-juss\`a; Jos\'e A. R. Fonollosa

arXiv:1707.07469·cs.CL·July 25, 2017

Character-level Intra Attention Network for Natural Language Inference

Han Yang, Marta R. Costa-juss\`a, Jos\'e A. R. Fonollosa

PDF

1 Repo

TL;DR

This paper introduces CIAN, a character-level neural network with intra-attention for natural language inference, achieving improved performance on the MNLI dataset by capturing intra-sentence semantics more effectively.

Contribution

The paper presents a novel character-level convolutional network combined with intra-attention for NLI, replacing traditional word embeddings and enhancing sentence understanding.

Findings

01

Improved accuracy on MNLI dataset

02

Effective intra-sentence semantic capture

03

Outperforms previous models on NLI task

Abstract

Natural language inference (NLI) is a central problem in language understanding. End-to-end artificial neural networks have reached state-of-the-art performance in NLI field recently. In this paper, we propose Character-level Intra Attention Network (CIAN) for the NLI task. In our model, we use the character-level convolutional network to replace the standard word embedding layer, and we use the intra attention to capture the intra-sentence semantics. The proposed CIAN model provides improved results based on a newly published MNLI corpus.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yanghanxy/CIAN
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.