PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents

Minghao Yan; Bo Peng; Benjamin Coleman; Ziqi Chen; Zhouhang Xie; Shuo Chen; Zhankui He; Noveen Sachdeva; Weili Wang; Ed H. Chi; Shivaram Venkataraman; Wang-Cheng Kang; Derek Zhiyuan Cheng; Beidou Wang

arXiv:2605.07039·cs.LG·May 11, 2026

PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents

Minghao Yan, Bo Peng, Benjamin Coleman, Ziqi Chen, Zhouhang Xie, Shuo Chen, Zhankui He, Noveen Sachdeva, Weili Wang, Ed H. Chi, Shivaram Venkataraman, Wang-Cheng Kang, Derek Zhiyuan Cheng, Beidou Wang

PDF

TL;DR

PACEvolve++ introduces a reinforcement learning framework that enables test-time policy adaptation for evolutionary search agents, improving convergence speed and stability across various tasks.

Contribution

It proposes a novel advisor-model RL framework with phase-adaptive training to enhance evolutionary search performance.

Findings

01

Outperforms state-of-the-art frameworks in convergence speed.

02

Stabilizes test-time training during evolutionary search.

03

Effective across diverse tasks like load balancing and protein fitness.

Abstract

Large language models have become drivers of evolutionary search, but most systems rely on a fixed, prompt-elicited policy to sample next candidates. This limits adaptation in practical engineering and research tasks, where evaluations are expensive, and progress depends on learning task-specific search dynamics. We introduce PACEvolve++, an advisor-model reinforcement learning framework for test-time policy adaptation in evolutionary search agents. PACEvolve++ decouples strategic search decisions from implementation: a trainable advisor generates, assesses, and selects hypotheses, while a stronger frontier model translates selected hypotheses into executable candidates. To train the advisor under non-stationary feedback, we propose a phase-adaptive approach that adapts its optimization strategy to different phases of the evolutionary process. Early in evolution, it uses group-relative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.