Cannot Predict Comment Volume of a News Article before (a few) Users Read It
Lihong He, Chen Shen, Arjun Mukherjee, Slobodan Vucetic, Eduard Dragut

TL;DR
This paper investigates predicting the total comment count on news articles, revealing that early comment dynamics are the most predictive factor, with minimal influence from article-specific features, across various outlets and categories.
Contribution
It demonstrates that early comment activity is the strongest predictor of total comments, highlighting the importance of social behavior over content features in engagement prediction.
Findings
Early comment rate predicts total comments effectively.
Prediction accuracy varies across news outlets and categories.
Article-specific features have limited impact on comment volume prediction.
Abstract
Many news outlets allow users to contribute comments on topics about daily world events. News articles are the seeds that spring users' interest to contribute content, i.e., comments. An article may attract an apathetic user engagement (several tens of comments) or a spontaneous fervent user engagement (thousands of comments). In this paper, we study the problem of predicting the total number of user comments a news article will receive. Our main insight is that the early dynamics of user comments contribute the most to an accurate prediction, while news article specific factors have surprisingly little influence. This appears to be an interesting and understudied phenomenon: collective social behavior at a news outlet shapes user response and may even downplay the content of an article. We compile and analyze a large number of features, both old and novel from literature. The features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Sentiment Analysis and Opinion Mining · Topic Modeling
