REVERE: Reflective Evolving Research Engineer for Scientific Workflows

Balaji Dinesh Gangireddi; Aniketh Garikaparthi; Manasi Patwardhan; Arman Cohan

arXiv:2603.20667·cs.SE·March 24, 2026

REVERE: Reflective Evolving Research Engineer for Scientific Workflows

Balaji Dinesh Gangireddi, Aniketh Garikaparthi, Manasi Patwardhan, Arman Cohan

PDF

Open Access

TL;DR

REVERE is a framework that enables scientific workflow agents to continually learn from experience, recognize recurring issues, and improve their performance by making targeted, knowledge-preserving edits across prompts and resources.

Contribution

It introduces a reflective, continual learning approach for research coding workflows that enhances performance by recognizing patterns and distilling reusable heuristics.

Findings

01

REVERE improves performance on research coding benchmarks by 3.5-4.9%.

02

It demonstrates the effectiveness of continual, reflective learning in scientific workflows.

03

Agents with REVERE evolve capabilities over time through global memory consolidation.

Abstract

Existing prompt-optimization techniques rely on local signals to update behavior, often neglecting broader and recurring patterns across tasks, leading to poor generalization; they further rely on full-prompt rewrites or unstructured merges, resulting in knowledge loss. These limitations are magnified in research-coding workflows, which involve heterogeneous repositories, underspecified environments, and weak feedback, where reproducing results from public codebases is an established evaluation regime. We introduce Reflective Evolving Research Engineer (REVERE), a framework that continuously learns from Global Training Context, recognizes recurring failure modes in cross-repository execution trajectories, distills them into reusable heuristics, and performs targeted edits across three configurable fields: the system prompt, a task-prompt template, and a cumulative cheatsheet. REVERE,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Machine Learning in Materials Science · Distributed and Parallel Computing Systems