Towards Unsupervised Content Disentanglement in Sentence Representations   via Syntactic Roles

Ghazi Felhi; Joseph Le Roux; Djam\'e Seddah

arXiv:2206.11184·cs.CL·June 23, 2022·1 cites

Towards Unsupervised Content Disentanglement in Sentence Representations via Syntactic Roles

Ghazi Felhi, Joseph Le Roux, Djam\'e Seddah

PDF

Open Access 1 Repo

TL;DR

This paper introduces ADVAE, a probabilistic model that learns to disentangle syntactic roles in sentence representations without supervision, enabling better interpretability and controllable content generation in NLP.

Contribution

The paper proposes ADVAE, an attention-based variational autoencoder that achieves unsupervised disentanglement of syntactic roles in sentence representations, outperforming classical models.

Findings

01

Disentanglement of syntactic roles achieved without supervision.

02

ADVAE outperforms classical sequence and Transformer VAEs in role separation.

03

Syntactic roles can be manipulated by intervening on latent variables.

Abstract

Linking neural representations to linguistic factors is crucial in order to build and analyze NLP models interpretable by humans. Among these factors, syntactic roles (e.g. subjects, direct objects, $\dots$ ) and their realizations are essential markers since they can be understood as a decomposition of predicative structures and thus the meaning of sentences. Starting from a deep probabilistic generative model with attention, we measure the interaction between latent variables and realizations of syntactic roles and show that it is possible to obtain, without supervision, representations of sentences where different syntactic roles correspond to clearly identified different latent variables. The probabilistic model we propose is an Attention-Driven Variational Autoencoder (ADVAE). Drawing inspiration from Transformer-based machine translation models, ADVAEs enable the analysis of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ghazi-f/advae
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Hate Speech and Cyberbullying Detection

MethodsAttention Is All You Need · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Dropout · Multi-Head Attention · Byte Pair Encoding · Label Smoothing · Residual Connection