Learning Disentangled Representations for Natural Language Definitions

Danilo S. Carvalho (1); Giangiacomo Mercatali (1); Yingji Zhang (1),; Andre Freitas (1; 2) ((1) Department of Computer Science; University of; Manchester; United Kingdom; (2) Idiap Research Institute; Switzerland)

arXiv:2210.02898·cs.CL·February 17, 2023·1 cites

Learning Disentangled Representations for Natural Language Definitions

Danilo S. Carvalho (1), Giangiacomo Mercatali (1), Yingji Zhang (1),, Andre Freitas (1, 2) ((1) Department of Computer Science, University of, Manchester, United Kingdom, (2) Idiap Research Institute, Switzerland)

PDF

Open Access

TL;DR

This paper introduces a method using definitional sentences and semantic structures to train a Variational Autoencoder that learns disentangled representations, enhancing interpretability and downstream NLP tasks.

Contribution

It proposes a novel approach leveraging semantic regularities in definitional sentences to improve disentanglement in neural models for NLP.

Findings

01

Outperforms unsupervised baselines in disentanglement benchmarks.

02

Improves downstream definition modeling performance.

03

Utilizes semantic structures for better representation learning.

Abstract

Disentangling the encodings of neural models is a fundamental aspect for improving interpretability, semantic control and downstream task performance in Natural Language Processing. Currently, most disentanglement methods are unsupervised or rely on synthetic datasets with known generative factors. We argue that recurrent syntactic and semantic regularities in textual data can be used to provide the models with both structural biases and generative factors. We leverage the semantic structures present in a representative and semantically dense category of sentence types, definitional sentences, for training a Variational Autoencoder to learn disentangled representations. Our experimental results show that the proposed model outperforms unsupervised baselines on several qualitative and quantitative benchmarks for disentanglement, and it also improves the results in the downstream task of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification