Learning to Compose Task-Specific Tree Structures

Jihun Choi; Kang Min Yoo; Sang-goo Lee

arXiv:1707.02786·cs.CL·November 22, 2017

Learning to Compose Task-Specific Tree Structures

Jihun Choi, Kang Min Yoo, Sang-goo Lee

PDF

1 Repo

TL;DR

This paper introduces Gumbel Tree-LSTM, a novel neural architecture that learns task-specific tree structures directly from plain text, improving performance and convergence speed in NLP tasks.

Contribution

The paper proposes a new Gumbel Tree-LSTM model that learns tree structures without structured input, using Gumbel-Softmax for dynamic parent node selection.

Findings

01

Outperforms or matches previous models in NLP tasks

02

Converges faster than existing models

03

Effectively learns task-specific tree structures from plain text

Abstract

For years, recursive neural networks (RvNNs) have been shown to be suitable for representing text into fixed-length vectors and achieved good performance on several natural language processing tasks. However, the main drawback of RvNNs is that they require structured input, which makes data preparation and model implementation hard. In this paper, we propose Gumbel Tree-LSTM, a novel tree-structured long short-term memory architecture that learns how to compose task-specific tree structures only from plain text data efficiently. Our model uses Straight-Through Gumbel-Softmax estimator to decide the parent node among candidates dynamically and to calculate gradients of the discrete decision. We evaluate the proposed model on natural language inference and sentiment analysis, and show that our model outperforms or is at least comparable to previous models. We also find that our model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jihunchoi/unsupervised-treelstm
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.