Can social media provide early warning of retraction? Evidence from critical tweets identified by human annotation and large language models

Er-Te Zheng; Hui-Zhen Fu; Mike Thelwall; Zhichao Fang

arXiv:2403.16851·cs.DL·September 26, 2025·3 cites

Can social media provide early warning of retraction? Evidence from critical tweets identified by human annotation and large language models

Er-Te Zheng, Hui-Zhen Fu, Mike Thelwall, Zhichao Fang

PDF

Open Access

TL;DR

This study investigates whether social media commentary, especially critical tweets, can serve as early warning signals for retracted scientific articles, highlighting the potential and limitations of AI-assisted detection methods.

Contribution

It demonstrates that critical tweets can precede retractions and evaluates the effectiveness of human annotation versus large language models in identifying problematic research discussions.

Findings

01

8.3% of retracted articles had critical tweets before retraction

02

Critical tweets are less common for non-retracted articles (1.5%)

03

AI models partially align with human annotations, indicating cautious use is needed

Abstract

Timely detection of problematic research is essential for safeguarding scientific integrity. To explore whether social media commentary can serve as an early indicator of potentially problematic articles, this study analysed 3,815 tweets referencing 604 retracted articles and 3,373 tweets referencing 668 comparable non-retracted articles. Tweets critical of the articles were identified through both human annotation and large language models (LLMs). Human annotation revealed that 8.3% of retracted articles were associated with at least one critical tweet prior to retraction, compared to only 1.5% of non-retracted articles, highlighting the potential of tweets as early warning signals of retraction. However, critical tweets identified by LLMs (GPT-4o mini, Gemini 2.0 Flash-Lite, and Claude 3.5 Haiku) only partially aligned with human annotation, suggesting that fully automated monitoring…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Academic integrity and plagiarism