A Novel Approach to Document Classification using WordNet
Koushiki Sarkar, Ritwika Law

TL;DR
This paper proposes a new document classification method leveraging WordNet to improve upon traditional bag-of-words approaches, aiming for better accuracy by utilizing semantic relationships.
Contribution
It introduces a WordNet-based classification technique that incorporates semantic correlations, offering an alternative to standard clustering methods in text mining.
Findings
Enhanced classification accuracy demonstrated in experiments
Semantic relationships improve feature representation
Outperforms traditional bag-of-words classifiers
Abstract
Content based Document Classification is one of the biggest challenges in the context of free text mining. Current algorithms on document classifications mostly rely on cluster analysis based on bag-of-words approach. However that method is still being applied to many modern scientific dilemmas. It has established a strong presence in fields like economics and social science to merit serious attention from the researchers. In this paper we would like to propose and explore an alternative grounded more securely on the dictionary classification and correlatedness of words and phrases. It is expected that application of our existing knowledge about the underlying classification structure may lead to improvement of the classifier's performance.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
