TL;DR
This paper introduces a new task of predicting the overall sentiment of tweet replies, presents a large annotated dataset called RETWEET, and proposes a two-stage deep learning approach that leverages automatically labeled data for training.
Contribution
The paper creates RETWEET, a large manually annotated dataset for reply sentiment prediction, and develops a novel two-stage deep learning method that does not require manual labels for training.
Findings
The proposed method achieves promising results on RETWEET.
Automatically labeled data can effectively train reply sentiment classifiers.
The dataset and baseline are publicly available for further research.
Abstract
Twitter sentiment analysis, which often focuses on predicting the polarity of tweets, has attracted increasing attention over the last years, in particular with the rise of deep learning (DL). In this paper, we propose a new task: predicting the predominant sentiment among (first-order) replies to a given tweet. Therefore, we created RETWEET, a large dataset of tweets and replies manually annotated with sentiment labels. As a strong baseline, we propose a two-stage DL-based method: first, we create automatically labeled training data by applying a standard sentiment classifier to tweet replies and aggregating its predictions for each original tweet; our rationale is that individual errors made by the classifier are likely to cancel out in the aggregation step. Second, we use the automatically labeled data for supervised training of a neural network to predict reply sentiment from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
