Unsupervised Learning of Explainable Parse Trees for Improved   Generalisation

Atul Sahay; Ayush Maheshwari; Ritesh Kumar; Ganesh Ramakrishnan,; Manjesh Kumar Hanawal; Kavi Arya

arXiv:2104.04998·cs.CL·April 13, 2021

Unsupervised Learning of Explainable Parse Trees for Improved Generalisation

Atul Sahay, Ayush Maheshwari, Ritesh Kumar, Ganesh Ramakrishnan,, Manjesh Kumar Hanawal, Kavi Arya

PDF

1 Repo

TL;DR

This paper introduces an attention-based Tree-LSTM model that learns more interpretable and meaningful parse trees, leading to improved performance across various NLP tasks and better linguistic structure discovery.

Contribution

It proposes a novel attention mechanism over Tree-LSTMs for learning explainable parse trees, enhancing interpretability and semantic correctness in NLP representations.

Findings

01

Improved performance on natural language inference, semantic relatedness, and sentiment analysis.

02

Learned parse trees are more explainable and linguistically meaningful.

03

Model outperforms recent RvNN-based methods.

Abstract

Recursive neural networks (RvNN) have been shown useful for learning sentence representations and helped achieve competitive performance on several natural language inference tasks. However, recent RvNN-based models fail to learn simple grammar and meaningful semantics in their intermediate tree representation. In this work, we propose an attention mechanism over Tree-LSTMs to learn more meaningful and explainable parse tree structures. We also demonstrate the superior performance of our proposed model on natural language inference, semantic relatedness, and sentiment analysis tasks and compare them with other state-of-the-art RvNN based methods. Further, we present a detailed qualitative and quantitative analysis of the learned parse trees and show that the discovered linguistic structures are more explainable, semantically meaningful, and grammatically correct than recent approaches.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

atul04/Explainable-Latent-Structures-Using-Attention
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.