Towards Unsupervised Content Disentanglement in Sentence Representations via Syntactic Roles
Ghazi Felhi, Joseph Le Roux, Djam\'e Seddah

TL;DR
This paper introduces ADVAE, a probabilistic model that learns to disentangle syntactic roles in sentence representations without supervision, enabling better interpretability and controllable content generation in NLP.
Contribution
The paper proposes ADVAE, an attention-based variational autoencoder that achieves unsupervised disentanglement of syntactic roles in sentence representations, outperforming classical models.
Findings
Disentanglement of syntactic roles achieved without supervision.
ADVAE outperforms classical sequence and Transformer VAEs in role separation.
Syntactic roles can be manipulated by intervening on latent variables.
Abstract
Linking neural representations to linguistic factors is crucial in order to build and analyze NLP models interpretable by humans. Among these factors, syntactic roles (e.g. subjects, direct objects,) and their realizations are essential markers since they can be understood as a decomposition of predicative structures and thus the meaning of sentences. Starting from a deep probabilistic generative model with attention, we measure the interaction between latent variables and realizations of syntactic roles and show that it is possible to obtain, without supervision, representations of sentences where different syntactic roles correspond to clearly identified different latent variables. The probabilistic model we propose is an Attention-Driven Variational Autoencoder (ADVAE). Drawing inspiration from Transformer-based machine translation models, ADVAEs enable the analysis of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Hate Speech and Cyberbullying Detection
MethodsAttention Is All You Need · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Dropout · Multi-Head Attention · Byte Pair Encoding · Label Smoothing · Residual Connection
