FLiText: A Faster and Lighter Semi-Supervised Text Classification with   Convolution Networks

Chen Liu; Mengchao Zhang; Zhibin Fu; Pan Hou; Yu Li

arXiv:2110.11869·cs.CL·October 25, 2021

FLiText: A Faster and Lighter Semi-Supervised Text Classification with Convolution Networks

Chen Liu, Mengchao Zhang, Zhibin Fu, Pan Hou, Yu Li

PDF

Open Access 1 Repo

TL;DR

FLiText is a novel semi-supervised learning framework that significantly improves lightweight text classification models' accuracy with minimal labeled data, outperforming existing methods on multiple benchmarks.

Contribution

Introduces FLiText, a semi-supervised framework with an inspirer network and consistency regularization, tailored for lightweight NLP models, achieving state-of-the-art results.

Findings

01

FLiText improves TextCNN accuracy from 51.00% to 90.49% on IMDb.

02

FLiText enhances Yelp-5 accuracy from 39.8% to 58.06%.

03

FLiText achieves over 6% accuracy gain using less than 1% labeled data.

Abstract

In natural language processing (NLP), state-of-the-art (SOTA) semi-supervised learning (SSL) frameworks have shown great performance on deep pre-trained language models such as BERT, and are expected to significantly reduce the demand for manual labeling. However, our empirical studies indicate that these frameworks are not suitable for lightweight models such as TextCNN, LSTM and etc. In this work, we develop a new SSL framework called FLiText, which stands for Faster and Lighter semi-supervised Text classification. FLiText introduces an inspirer network together with the consistency regularization framework, which leverages a generalized regular constraint on the lightweight models for efficient SSL. As a result, FLiText obtains new SOTA performance for lightweight models across multiple SSL benchmarks on text classification. Compared with existing SOTA SSL methods on TextCNN, FLiText…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

valuesimplex/flitext
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Softmax · Residual Connection · WordPiece · Dense Connections · Tanh Activation · Linear Warmup With Linear Decay