A Survey on Text Classification: From Shallow to Deep Learning
Qian Li, Hao Peng, Jianxin Li, Congying Xia, Renyu Yang, Lichao Sun,, Philip S. Yu, Lifang He

TL;DR
This survey comprehensively reviews text classification methods from traditional approaches to deep learning, covering models, datasets, evaluation metrics, and future research directions in natural language processing.
Contribution
It provides an updated taxonomy and comparative analysis of text classification techniques from 1961 to 2021, highlighting technical developments and benchmark datasets.
Findings
Deep learning has significantly advanced text classification.
Traditional models are still relevant for certain tasks.
Evaluation metrics have varied in effectiveness.
Abstract
Text classification is the most fundamental and essential task in natural language processing. The last decade has seen a surge of research in this area due to the unprecedented success of deep learning. Numerous methods, datasets, and evaluation metrics have been proposed in the literature, raising the need for a comprehensive and updated survey. This paper fills the gap by reviewing the state-of-the-art approaches from 1961 to 2021, focusing on models from traditional models to deep learning. We create a taxonomy for text classification according to the text involved and the models used for feature extraction and classification. We then discuss each of these categories in detail, dealing with both the technical developments and benchmark datasets that support tests of predictions. A comprehensive comparison between different techniques, as well as identifying the pros and cons of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques
