Performance Investigation of Feature Selection Methods
Anuj sharma, Shubhamoy Dey

TL;DR
This study evaluates various feature selection methods and sentiment lexicons to improve sentiment analysis classification performance on movie reviews, highlighting the effectiveness of Gain Ratio and Information Gain.
Contribution
It systematically compares five feature selection techniques and three sentiment lexicons, providing insights into their relative effectiveness for sentiment classification.
Findings
Information Gain showed consistent results.
Gain Ratio performed best overall.
Sentiment lexicons had poor performance.
Abstract
Sentiment analysis or opinion mining has become an open research domain after proliferation of Internet and Web 2.0 social media. People express their attitudes and opinions on social media including blogs, discussion forums, tweets, etc. and, sentiment analysis concerns about detecting and extracting sentiment or opinion from online text. Sentiment based text classification is different from topical text classification since it involves discrimination based on expressed opinion on a topic. Feature selection is significant for sentiment analysis as the opinionated text may have high dimensions, which can adversely affect the performance of sentiment analysis classifier. This paper explores applicability of feature selection methods for sentiment analysis and investigates their performance for classification in term of recall, precision and accuracy. Five feature selection methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Text and Document Classification Technologies · Spam and Phishing Detection
