A Comprehensive Survey of Text Classification Techniques and Their Research Applications: Observational and Experimental Insights
Kamal Taha, Paul D. Yoo, Chan Yeun, Aya Taha

TL;DR
This survey provides a detailed taxonomy and evaluation of text classification techniques, highlighting their research applications and offering insights into their effectiveness through empirical and experimental analysis.
Contribution
It introduces a hierarchical taxonomy for text classification based on research fields and evaluates techniques using dual empirical and experimental approaches.
Findings
Structured taxonomy enhances understanding of classification algorithms.
Empirical assessment across four criteria offers comprehensive insights.
Experimental comparison ranks techniques within specific research sub-categories.
Abstract
The exponential growth of textual data presents substantial challenges in management and analysis, notably due to high storage and processing costs. Text classification, a vital aspect of text mining, provides robust solutions by enabling efficient categorization and organization of text data. These techniques allow individuals, researchers, and businesses to derive meaningful patterns and insights from large volumes of text. This survey paper introduces a comprehensive taxonomy specifically designed for text classification based on research fields. The taxonomy is structured into hierarchical levels: research field-based category, research field-based sub-category, methodology-based technique, methodology sub-technique, and research field applications. We employ a dual evaluation approach: empirical and experimental. Empirically, we assess text classification techniques across four…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies
