Social Network Datasets on Reddit Financial Discussion
Zezhong Wang, Siyang Hao, Inez Maria Zwetsloot, Simon Trimborn

TL;DR
This paper introduces a new Reddit dataset focused on discussions about meme stocks like GME, AMC, and BlackBerry, highlighting their influence on stock market movements and providing a resource for further analysis.
Contribution
The paper presents a novel dataset of Reddit posts and comments related to meme stocks, linking social media discussions to stock market fluctuations.
Findings
Posts about meme stocks are correlated with market movements.
The dataset enables analysis of social media influence on finance.
Data collection and processing methods are documented.
Abstract
Stock markets are impacted by a large variety of factors including news and discussions among investors about investment opportunities. With the emergence of social media, new opportunities for having financial discussions arose. The market frenzy surrounding GameStop (GME) on the Reddit subreddit Wallstreetbets, caused financial discussion forums to receive widespread attention and it was established that Wallstreetbets played a leading role in the stock market movements of GME. Here, we present a new data set for exploring the effect of social media discussion forums on the stock market. The dataset consists of posts published on various Reddit subreddits concerning the popular meme stocks GameStop (GME), American Multi-Cinema Entertainment Holdings (AMC), and BlackBerry (BB). We document the data collection and processing steps and show that the posts and comments about these meme…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Sentiment Analysis and Opinion Mining · Complex Network Analysis Techniques
