Challenging the Semi-Supervised VAE Framework for Text Classification

Ghazi Felhi; Joseph Le Roux; Djam\'e Seddah

arXiv:2109.12969·cs.CL·September 28, 2021

Challenging the Semi-Supervised VAE Framework for Text Classification

Ghazi Felhi, Joseph Le Roux, Djam\'e Seddah

PDF

Open Access 1 Repo

TL;DR

This paper simplifies the semi-supervised variational autoencoder framework for text classification by removing certain components, resulting in faster training without sacrificing accuracy, and challenges the necessity of the standard complex design.

Contribution

The paper introduces simplified SSVAE models that remove the KL divergence term and unobserved latent variables, improving efficiency and practicality for text classification.

Findings

01

26% faster training speed

02

Maintains equivalent classification accuracy

03

Simplifies model design and reduces complexity

Abstract

Semi-Supervised Variational Autoencoders (SSVAEs) are widely used models for data efficient learning. In this paper, we question the adequacy of the standard design of sequence SSVAEs for the task of text classification as we exhibit two sources of overcomplexity for which we provide simplifications. These simplifications to SSVAEs preserve their theoretical soundness while providing a number of practical advantages in the semi-supervised setup where the result of training is a text classifier. These simplifications are the removal of (i) the Kullback-Liebler divergence from its objective and (ii) the fully unobserved latent variable from its probabilistic model. These changes relieve users from choosing a prior for their latent variables, make the model smaller and faster, and allow for a better flow of information into the latent variables. We compare the simplified versions to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ghazi-f/challenging-ssvaes
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis