Reflective Paper-to-Code Reproduction Enabled by Fine-Grained Verification

Mingyang Zhou; Quanming Yao; Lun Du; Lanning Wei; Da Zheng

arXiv:2508.16671·cs.SE·August 26, 2025

Reflective Paper-to-Code Reproduction Enabled by Fine-Grained Verification

Mingyang Zhou, Quanming Yao, Lun Du, Lanning Wei, Da Zheng

PDF

TL;DR

RePro is a framework that improves paper-to-code reproduction by automatically extracting detailed criteria from research papers, generating code, and iteratively verifying and refining it to match the original implementation.

Contribution

The paper introduces RePro, a novel reflective framework that systematically extracts paper fingerprints and uses iterative verification to enhance reproduction accuracy.

Findings

01

RePro reduces the performance gap by 13.0% on PaperBench Code-Dev.

02

It effectively revises complex logical and mathematical criteria.

03

The approach outperforms existing baselines in paper-to-code reproduction.

Abstract

Reproducing machine learning papers is essential for scientific progress but remains challenging for both humans and automated agents. Existing agent-based methods often struggle to fully and accurately reproduce implementation details such as mathematical formulas and algorithmic logic. Previous studies show that reflection with explicit feedback improves agent performance. However, current paper reproduction methods fail to effectively adopt this strategy. This gap mainly arises from the diverse paper patterns, complex method modules, and varied configurations encountered in research papers. Motivated by how humans use systematic checklists to efficiently debug complex code, we propose \textbf{RePro}, a \textbf{Re}flective Paper-to-Code \textbf{Repro}duction framework that automatically extracts a paper's fingerprint, referring to a comprehensive set of accurate and atomic criteria…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.