Document classification methods
Madjid Khalilian, Shiva Hassanzadeh

TL;DR
This paper reviews various supervised and unsupervised document classification methods, comparing their advantages and disadvantages to improve text organization and retrieval efficiency.
Contribution
It provides a comparative analysis of multiple document classification techniques, highlighting their strengths and weaknesses.
Findings
Different classification methods have distinct advantages and limitations.
Supervised methods generally achieve higher accuracy than unsupervised ones.
The study offers insights into selecting appropriate methods for specific applications.
Abstract
Information on different fields which are collected by users requires appropriate management and organization to be structured in a standard way and retrieved fast and more easily. Document classification is a conventional method to separate text based on their subjects among scientific text, web pages and digital library. Different methods and techniques are proposed for document classifications that have advantages and deficiencies. In this paper, several unsupervised and supervised document classification methods are studied and compared.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Advanced Text Analysis Techniques · Spam and Phishing Detection
