A protocol to gather, characterize and analyze incoming citations of retracted articles
Ivan Heibi, Silvio Peroni

TL;DR
This paper introduces a comprehensive methodology for collecting, characterizing, and analyzing citations of retracted articles to understand citation behaviors and patterns related to retractions.
Contribution
It presents a novel, step-by-step protocol combining data collection, feature extraction, statistical analysis, and topic modeling for studying citations of retracted papers.
Findings
Provides a structured approach for analyzing retracted article citations
Enables visualization of citation behaviors and patterns
Facilitates understanding of retraction impact on scholarly communication
Abstract
In this article, we present a methodology which takes as input a collection of retracted articles, gathers the entities citing them, characterizes such entities according to multiple dimensions (disciplines, year of publication, sentiment, etc.), and applies a quantitative and qualitative analysis on the collected values. The methodology is composed of four phases: (1) identifying, retrieving, and extracting basic metadata of the entities which have cited a retracted article, (2) extracting and labeling additional features based on the textual content of the citing entities, (3) building a descriptive statistical summary based on the collected data, and finally (4) running a topic modeling analysis. The goal of the methodology is to generate data and visualizations that help understanding possible behaviors related to retraction cases. We present the methodology in a structured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAcademic integrity and plagiarism
