Imitation Game: Reproducing Deep Learning Bugs Leveraging an Intelligent Agent

Mehil B Shah; Mohammad Masudur Rahman; Foutse Khomh

arXiv:2512.14990·cs.SE·February 27, 2026

Imitation Game: Reproducing Deep Learning Bugs Leveraging an Intelligent Agent

Mehil B Shah, Mohammad Masudur Rahman, Foutse Khomh

PDF

Open Access

TL;DR

This paper introduces RepGen, an automated intelligent system that significantly improves the reproduction of deep learning bugs, addressing the challenge of nondeterminism and environmental coupling in DL applications.

Contribution

RepGen is a novel approach that leverages learning-enhanced context and an iterative generate-validate-refine process with an LLM to reproduce DL bugs more effectively.

Findings

01

Achieves 80.19% bug reproduction rate, outperforming previous methods.

02

Reduces bug reproduction time by 56.8%.

03

Decreases cognitive load for developers.

Abstract

Despite their wide adoption in various domains (e.g., healthcare, finance, software engineering), Deep Learning (DL)-based applications suffer from many bugs, failures, and vulnerabilities. Reproducing these bugs is essential for their resolution, but it is extremely challenging due to the inherent nondeterminism of DL models and their tight coupling with hardware and software environments. According to recent studies, only about 3% of DL bugs can be reliably reproduced using manual approaches. To address these challenges, we present RepGen, a novel, automated, and intelligent approach for reproducing deep learning bugs. RepGen constructs a learning-enhanced context from a project, develops a comprehensive plan for bug reproduction, employs an iterative generate-validate-refine mechanism, and thus generates such code using an LLM that reproduces the bug at hand. We evaluate RepGen on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Software Engineering Research · Advanced Malware Detection Techniques