A Scientific Human-Agent Reproduction Pipeline
Joschka Birk, Gregor Kasieczka, Siddharth Mishra-Sharma, Benjamin Nachman, Dennis Noll, Tanvi Wamorkar

TL;DR
SHARP is a structured framework that leverages AI agents and human collaboration to automate and improve the reproducibility of scientific data analyses, emphasizing human oversight and interaction.
Contribution
The paper introduces SHARP, a novel AI-assisted pipeline for scientific reproduction that decomposes tasks into steps managed by specialized subagents with human oversight.
Findings
Successfully reproduced a jet classification task in particle physics.
Evaluated analysis performance, code quality, and human-agent interaction.
Demonstrated human control in AI-assisted scientific reproduction.
Abstract
Reproducing scientific analyses is essential for preserving knowledge, building extensible codebases, and deepening researcher understanding - yet the effort often outweighs its academic recognition. We argue that the reproduction of scientific data analyses is fundamentally a translation task: converting human-readable knowledge (papers, documentation) into machine-readable analysis code. This makes it uniquely well-suited for AI agents. We present SHARP (Scientific Human-Agent Reproduction Pipeline), a structured framework for reproducing scientific analyses through human-agent collaboration. SHARP decomposes a reproduction task into discrete steps, which an AI agent executes autonomously using specialized subagents for code generation, testing, and quality assurance. At defined checkpoints, the researcher reviews progress, provides feedback, and steers the analysis - keeping the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
