Modelling Online Comment Threads from their Start
Rachel Krohn, Tim Weninger

TL;DR
This paper introduces the Comment Thread Prediction Model (CTPM), which accurately forecasts the size and structure of online comment threads using only the initial post text, enabling predictions before comments appear.
Contribution
The paper presents a novel model that predicts comment thread size and shape solely from initial post text, outperforming existing models especially for new posts.
Findings
CTPM significantly outperforms existing models on Reddit data.
The model effectively predicts thread growth without waiting for comments.
Results demonstrate high accuracy in diverse subreddit discussions.
Abstract
The social Web is a widely used platform for online discussion. Across social media, users can start discussions by posting a topical image, url, or message. Upon seeing this initial post, other users may add their own comments to the post, or to another user's comment. The resulting online discourse produces a comment thread, which constitutes an enormous portion of modern online communication. Comment threads are often viewed as trees: nodes represent the post and its comments, while directed edges represent reply-to relationships. The goal of the present work is to predict the size and shape of these comment threads. Existing models do this by observing the first several comments and then fitting a predictive model. However, most comment threads are relatively small, and waiting for data to materialize runs counter to the goal of the prediction task. We therefore introduce the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
