BERT-Flow-VAE: A Weakly-supervised Model for Multi-Label Text Classification
Ziwen Liu, Josep Grau-Bove, Scott Allan Orr

TL;DR
BERT-Flow-VAE is a weakly-supervised model for multi-label text classification that reduces annotation costs by leveraging BERT embeddings, flow calibration, seed-based topic modeling, and a VAE framework, achieving near-supervised performance.
Contribution
The paper introduces BERT-Flow-VAE, a novel weakly-supervised approach combining embedding calibration, seed-based topic generation, and a VAE to effectively perform multi-label text classification with minimal supervision.
Findings
Outperforms baseline weakly-supervised models on 6 datasets
Achieves approximately 84% of fully-supervised model performance
Effective use of seed words and flow calibration enhances classification accuracy
Abstract
Multi-label Text Classification (MLTC) is the task of categorizing documents into one or more topics. Considering the large volumes of data and varying domains of such tasks, fully supervised learning requires manually fully annotated datasets which is costly and time-consuming. In this paper, we propose BERT-Flow-VAE (BFV), a Weakly-Supervised Multi-Label Text Classification (WSMLTC) model that reduces the need for full supervision. This new model (1) produces BERT sentence embeddings and calibrates them using a flow model, (2) generates an initial topic-document matrix by averaging results of a seeded sparse topic model and a textual entailment model which only require surface name of topics and 4-6 seed words per topic, and (3) adopts a VAE framework to reconstruct the embeddings under the guidance of the topic-document matrix. Finally, (4) it uses the means produced by the encoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Layer Normalization · Residual Connection · Dropout · Weight Decay · Adam · Softmax · WordPiece
