Using Text Similarity to Detect Social Interactions not Captured by Formal Reply Mechanisms
Samuel Barbosa, Roberto M. Cesar-Jr, Dan Cosley

TL;DR
This study introduces a tf-idf similarity method to detect implicit social responses on Twitter, revealing that over 11% of reactions are missed by explicit reply mechanisms, which impacts social interaction modeling.
Contribution
The paper presents a novel approach using text similarity to identify non-explicit responses, highlighting the extent of missed reactions in social media analysis.
Findings
At least 11% of reactions are not captured by explicit reply mechanisms.
Some users are more responsive through implicit responses than official interfaces.
Detecting non-explicit responses improves understanding of social interactions.
Abstract
In modeling social interaction online, it is important to understand when people are reacting to each other. Many systems have explicit indicators of replies, such as threading in discussion forums or replies and retweets in Twitter. However, it is likely these explicit indicators capture only part of people's reactions to each other, thus, computational social science approaches that use them to infer relationships or influence are likely to miss the mark. This paper explores the problem of detecting non-explicit responses, presenting a new approach that uses tf-idf similarity between a user's own tweets and recent tweets by people they follow. Based on a month's worth of posting data from 449 ego networks in Twitter, this method demonstrates that it is likely that at least 11% of reactions are not captured by the explicit reply and retweet mechanisms. Further, these uncaptured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
