TL;DR
This paper presents a regression model that uses social media engagement metrics from Reddit to accurately predict sports viewership, aiding in revenue forecasting and advertising strategies.
Contribution
It introduces a novel approach combining social media data and regression techniques to predict sports viewership with high accuracy, including specific improvements for different sports.
Findings
Achieved an R^2 of 0.99 in viewership prediction.
Model's MAE of 1.27 million viewers demonstrates high precision.
Effective use of social media metrics for audience engagement analysis.
Abstract
Accurately predicting sports viewership is crucial for optimizing ad sales and revenue forecasting. Social media platforms, such as Reddit, provide a wealth of user-generated content that reflects audience engagement and interest. In this study, we propose a regression-based approach to predict sports viewership using social media metrics, including post counts, comments, scores, and sentiment analysis from TextBlob and VADER. Through iterative improvements, such as focusing on major sports subreddits, incorporating categorical features, and handling outliers by sport, the model achieved an of 0.99, a Mean Absolute Error (MAE) of 1.27 million viewers, and a Root Mean Squared Error (RMSE) of 2.33 million viewers on the full dataset. These results demonstrate the model's ability to accurately capture patterns in audience behavior, offering significant potential for pre-event revenue…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
