Disentangling semantics in language through VAEs and a certain   architectural choice

Ghazi Felhi; Joseph Le Roux; Djam\'e Seddah

arXiv:2012.13031·cs.CL·December 29, 2020·1 cites

Disentangling semantics in language through VAEs and a certain architectural choice

Ghazi Felhi, Joseph Le Roux, Djam\'e Seddah

PDF

Open Access 1 Repo

TL;DR

This paper introduces an unsupervised approach using modified Transformers and VAEs to learn disentangled sentence representations, enabling manipulation of semantic elements like verbs and objects.

Contribution

It proposes a novel architecture that disentangles semantic components in sentences without supervision, facilitating targeted semantic modifications and swaps.

Findings

01

Successfully separates sentence elements into distinct latent variables

02

Varying latent variables alters specific sentence components

03

Swapping latent variables produces meaningful semantic changes

Abstract

We present an unsupervised method to obtain disentangled representations of sentences that single out semantic content. Using modified Transformers as building blocks, we train a Variational Autoencoder to translate the sentence to a fixed number of hierarchically structured latent variables. We study the influence of each latent variable in generation on the dependency structure of sentences, and on the predicate structure it yields when passed through an Open Information Extraction model. Our model could separate verbs, subjects, direct objects, and prepositional objects into latent variables we identified. We show that varying the corresponding latent variables results in varying these elements in sentences, and that swapping them between couples of sentences leads to the expected partial semantic swap.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ghazi-f/Disentanglement_Transformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Authorship Attribution and Profiling

MethodsSolana Customer Service Number +1-833-534-1729