FLiText: A Faster and Lighter Semi-Supervised Text Classification with Convolution Networks
Chen Liu, Mengchao Zhang, Zhibin Fu, Pan Hou, Yu Li

TL;DR
FLiText is a novel semi-supervised learning framework that significantly improves lightweight text classification models' accuracy with minimal labeled data, outperforming existing methods on multiple benchmarks.
Contribution
Introduces FLiText, a semi-supervised framework with an inspirer network and consistency regularization, tailored for lightweight NLP models, achieving state-of-the-art results.
Findings
FLiText improves TextCNN accuracy from 51.00% to 90.49% on IMDb.
FLiText enhances Yelp-5 accuracy from 39.8% to 58.06%.
FLiText achieves over 6% accuracy gain using less than 1% labeled data.
Abstract
In natural language processing (NLP), state-of-the-art (SOTA) semi-supervised learning (SSL) frameworks have shown great performance on deep pre-trained language models such as BERT, and are expected to significantly reduce the demand for manual labeling. However, our empirical studies indicate that these frameworks are not suitable for lightweight models such as TextCNN, LSTM and etc. In this work, we develop a new SSL framework called FLiText, which stands for Faster and Lighter semi-supervised Text classification. FLiText introduces an inspirer network together with the consistency regularization framework, which leverages a generalized regular constraint on the lightweight models for efficient SSL. As a result, FLiText obtains new SOTA performance for lightweight models across multiple SSL benchmarks on text classification. Compared with existing SOTA SSL methods on TextCNN, FLiText…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Softmax · Residual Connection · WordPiece · Dense Connections · Tanh Activation · Linear Warmup With Linear Decay
