ThetaEvolve: Test-time Learning on Open Problems

Yiping Wang; Shao-Rong Su; Zhiyuan Zeng; Eva Xu; Liliang Ren; Xinyu Yang; Zeyi Huang; Xuehai He; Luyao Ma; Baolin Peng; Hao Cheng; Pengcheng He; Weizhu Chen; Shuohang Wang; Simon Shaolei Du; Yelong Shen

arXiv:2511.23473·cs.LG·December 1, 2025

ThetaEvolve: Test-time Learning on Open Problems

Yiping Wang, Shao-Rong Su, Zhiyuan Zeng, Eva Xu, Liliang Ren, Xinyu Yang, Zeyi Huang, Xuehai He, Luyao Ma, Baolin Peng, Hao Cheng, Pengcheng He, Weizhu Chen, Shuohang Wang, Simon Shaolei Du, Yelong Shen

PDF

Open Access

TL;DR

ThetaEvolve is an open-source framework that enables small language models to learn and improve solutions to open problems at test time through reinforcement learning, surpassing inference-only methods.

Contribution

It introduces a scalable, test-time learning framework for LLMs that allows continual improvement on open problems, extending prior closed-source approaches like AlphaEvolve.

Findings

01

ThetaEvolve achieves new best-known bounds on open problems.

02

RL at test time improves model performance and learning speed.

03

Models trained with RL outperform inference-only baselines.

Abstract

Recent advances in large language models (LLMs) have enabled breakthroughs in mathematical discovery, exemplified by AlphaEvolve, a closed-source system that evolves programs to improve bounds on open problems. However, it relies on ensembles of frontier LLMs to achieve new bounds and is a pure inference system that models cannot internalize the evolving strategies. We introduce ThetaEvolve, an open-source framework that simplifies and extends AlphaEvolve to efficiently scale both in-context learning and Reinforcement Learning (RL) at test time, allowing models to continually learn from their experiences in improving open optimization problems. ThetaEvolve features a single LLM, a large program database for enhanced exploration, batch sampling for higher throughput, lazy penalties to discourage stagnant outputs, and optional reward shaping for stable training signals, etc. ThetaEvolve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Machine Learning in Materials Science