An Improved Text Sentiment Classification Model Using TF-IDF and Next   Word Negation

Bijoyan Das; Sarit Chakraborty

arXiv:1806.06407·cs.CL·June 19, 2018·47 cites

An Improved Text Sentiment Classification Model Using TF-IDF and Next Word Negation

Bijoyan Das, Sarit Chakraborty

PDF

Open Access

TL;DR

This paper introduces an enhanced sentiment classification approach combining TF-IDF with Next Word Negation, demonstrating improved accuracy across multiple algorithms, especially with Linear SVM, for automatic electronic document analysis.

Contribution

The paper proposes integrating Next Word Negation with TF-IDF for sentiment classification, showing significant accuracy improvements over traditional models.

Findings

01

TF-IDF-NWN outperforms binary bag of words and TF-IDF alone.

02

Linear SVM achieves highest accuracy with the proposed model.

03

Significant accuracy increase compared to previous methods.

Abstract

With the rapid growth of Text sentiment analysis, the demand for automatic classification of electronic documents has increased by leaps and bound. The paradigm of text classification or text mining has been the subject of many research works in recent time. In this paper we propose a technique for text sentiment classification using term frequency- inverse document frequency (TF-IDF) along with Next Word Negation (NWN). We have also compared the performances of binary bag of words model, TF-IDF model and TF-IDF with next word negation (TF-IDF-NWN) model for text classification. Our proposed model is then applied on three different text mining algorithms and we found the Linear Support vector machine (LSVM) is the most appropriate to work with our proposed model. The achieved results show significant increase in accuracy compared to earlier methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Text and Document Classification Technologies · Spam and Phishing Detection