Can citations tell us about a paper's reproducibility? A case study of   machine learning papers

Rochana R. Obadage; Sarah M. Rajtmajer; Jian Wu

arXiv:2405.03977·cs.DL·May 8, 2024

Can citations tell us about a paper's reproducibility? A case study of machine learning papers

Rochana R. Obadage, Sarah M. Rajtmajer, Jian Wu

PDF

1 Repo

TL;DR

This paper investigates whether analyzing citation contexts can serve as an indicator of a machine learning paper's reproducibility, using sentiment analysis to interpret reproduction outcomes.

Contribution

It introduces a sentiment analysis framework for citation contexts and explores their correlation with reproducibility scores in ML research.

Findings

01

Classifiers for reproducibility-related citation contexts were successfully trained.

02

Sentiment in citation contexts correlates with reproducibility outcomes.

03

The approach offers a potential tool for assessing reproducibility through citation analysis.

Abstract

The iterative character of work in machine learning (ML) and artificial intelligence (AI) and reliance on comparisons against benchmark datasets emphasize the importance of reproducibility in that literature. Yet, resource constraints and inadequate documentation can make running replications particularly challenging. Our work explores the potential of using downstream citation contexts as a signal of reproducibility. We introduce a sentiment analysis framework applied to citation contexts from papers involved in Machine Learning Reproducibility Challenges in order to interpret the positive or negative outcomes of reproduction attempts. Our contributions include training classifiers for reproducibility-related contexts and sentiment analysis, and exploring correlations between citation context sentiment and reproducibility scores. Study data, software, and an artifact appendix are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lamps-lab/ccair-ai-reproducibility
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.