Machine Learning Sentiment Prediction based on Hybrid Document Representation
Panagiotis Stalidis, Maria Giatsoglou, Konstantinos Diamantaras,, George Sarigiannidis, Konstantinos Ch. Chatzisavvas

TL;DR
This paper introduces a hybrid text representation method combining Word2Vec, Bag-of-Words, and sentiment lexicons, improving sentiment classification accuracy on standard datasets compared to individual methods.
Contribution
The paper proposes a novel hybrid vectorization approach for sentiment analysis that enhances classification performance by combining multiple text representation techniques.
Findings
Hybrid approach outperforms individual representations in accuracy
Achieves comparable results to state-of-the-art methods
Effective for sentiment detection on standard datasets
Abstract
Automated sentiment analysis and opinion mining is a complex process concerning the extraction of useful subjective information from text. The explosion of user generated content on the Web, especially the fact that millions of users, on a daily basis, express their opinions on products and services to blogs, wikis, social networks, message boards, etc., render the reliable, automated export of sentiments and opinions from unstructured text crucial for several commercial applications. In this paper, we present a novel hybrid vectorization approach for textual resources that combines a weighted variant of the popular Word2Vec representation (based on Term Frequency-Inverse Document Frequency) representation and with a Bag- of-Words representation and a vector of lexicon-based sentiment values. The proposed text representation approach is assessed through the application of several…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Web Data Mining and Analysis · Text and Document Classification Technologies
