REVERE: Reflective Evolving Research Engineer for Scientific Workflows
Balaji Dinesh Gangireddi, Aniketh Garikaparthi, Manasi Patwardhan, Arman Cohan

TL;DR
REVERE is a framework that enables scientific workflow agents to continually learn from experience, recognize recurring issues, and improve their performance by making targeted, knowledge-preserving edits across prompts and resources.
Contribution
It introduces a reflective, continual learning approach for research coding workflows that enhances performance by recognizing patterns and distilling reusable heuristics.
Findings
REVERE improves performance on research coding benchmarks by 3.5-4.9%.
It demonstrates the effectiveness of continual, reflective learning in scientific workflows.
Agents with REVERE evolve capabilities over time through global memory consolidation.
Abstract
Existing prompt-optimization techniques rely on local signals to update behavior, often neglecting broader and recurring patterns across tasks, leading to poor generalization; they further rely on full-prompt rewrites or unstructured merges, resulting in knowledge loss. These limitations are magnified in research-coding workflows, which involve heterogeneous repositories, underspecified environments, and weak feedback, where reproducing results from public codebases is an established evaluation regime. We introduce Reflective Evolving Research Engineer (REVERE), a framework that continuously learns from Global Training Context, recognizes recurring failure modes in cross-repository execution trajectories, distills them into reusable heuristics, and performs targeted edits across three configurable fields: the system prompt, a task-prompt template, and a cumulative cheatsheet. REVERE,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Machine Learning in Materials Science · Distributed and Parallel Computing Systems
