Evolutionary System 2 Reasoning: An Empirical Proof

Zeyuan Ma; Wenqi Huang; Guo-Huan Song; Hongshu Guo; Sijie Ma; Zhiguang Cao; Yue-Jiao Gong

arXiv:2512.05760·cs.AI·December 8, 2025

Evolutionary System 2 Reasoning: An Empirical Proof

Zeyuan Ma, Wenqi Huang, Guo-Huan Song, Hongshu Guo, Sijie Ma, Zhiguang Cao, Yue-Jiao Gong

PDF

Open Access 1 Video

TL;DR

This paper introduces an evolutionary framework to enhance reasoning abilities in large language models, demonstrating that even weaker models can develop strong reasoning skills through evolutionary optimization.

Contribution

The paper proposes the ERO framework that evolves LLMs to improve their reasoning ability, showing that simple evolutionary strategies can significantly enhance weaker models.

Findings

01

GPT-5 shows limited reasoning ability

02

Weak models can be evolved to strong reasoners

03

Evolutionary optimization improves reasoning performance

Abstract

Machine intelligence marks the ultimate dream of making machines' intelligence comparable to human beings. While recent progress in Large Language Models (LLMs) show substantial specific skills for a wide array of downstream tasks, they more or less fall shorts in general intelligence. Following correlation between intelligence and system 2 reasoning (slow thinking), in this paper, we aim to answering a worthwhile research question: could machine intelligence such as LLMs be evolved to acquire reasoning ability (not specific skill) just like our human beings? To this end, we propose evolutionary reasoning optimization (ERO) framework which performs survival of the fittest over a population of LLMs to search for individual with strong reasoning ability. Given a reasoning task, ERO first initializes multiple LLMs as a population, after which an evolutionary strategy evolves the population…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Evolutionary System 2 Reasoning: An Empirical Proof· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Language and cultural evolution