Learning to Embed Sentences Using Attentive Recursive Trees

Jiaxin Shi; Lei Hou; Juanzi Li; Zhiyuan Liu; Hanwang Zhang

arXiv:1811.02338·cs.CL·November 16, 2018·1 cites

Learning to Embed Sentences Using Attentive Recursive Trees

Jiaxin Shi, Lei Hou, Juanzi Li, Zhiyuan Liu, Hanwang Zhang

PDF

Open Access 2 Repos

TL;DR

This paper introduces AR-Tree, an attentive recursive tree model that dynamically emphasizes important words in sentence embeddings, improving performance on sentence understanding tasks.

Contribution

The paper proposes a novel attentive recursive tree model with reinforced training that highlights task-informative words in sentence embeddings.

Findings

01

AR-Tree outperforms state-of-the-art methods on three tasks.

02

Dynamic word importance improves embedding quality.

03

Reinforced training enhances model performance.

Abstract

Sentence embedding is an effective feature representation for most deep learning-based NLP tasks. One prevailing line of methods is using recursive latent tree-structured networks to embed sentences with task-specific structures. However, existing models have no explicit mechanism to emphasize task-informative words in the tree structure. To this end, we propose an Attentive Recursive Tree model (AR-Tree), where the words are dynamically located according to their importance in the task. Specifically, we construct the latent tree for a sentence in a proposed important-first strategy, and place more attentive words nearer to the root; thus, AR-Tree can inherently emphasize important words during the bottom-up composition of the sentence embedding. We propose an end-to-end reinforced training strategy for AR-Tree, which is demonstrated to consistently outperform, or be at least comparable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining