Evolutionary Task Discovery: Advancing Reasoning Frontiers via Skill Composition and Complexity Scaling

Liqin Ye; Yanbin Yin; Michael Galarnyk; Yuzhao Heng; Sudheer Chava; Chao Zhang

arXiv:2605.11666·cs.LG·May 13, 2026

Evolutionary Task Discovery: Advancing Reasoning Frontiers via Skill Composition and Complexity Scaling

Liqin Ye, Yanbin Yin, Michael Galarnyk, Yuzhao Heng, Sudheer Chava, Chao Zhang

PDF

1 Repo

TL;DR

EvoTD introduces a structured evolutionary framework for data synthesis, enhancing reasoning capabilities of LLMs by systematically expanding task diversity and complexity.

Contribution

The paper presents EvoTD, a novel evolutionary approach with structured operators and a dynamic filter to improve reasoning by systematically generating diverse, complex tasks.

Findings

01

EvoTD achieves significant reasoning improvements across various models.

02

Structured evolutionary operators enhance task diversity and complexity.

03

Code is publicly available at https://github.com/liqinye/EvoTD.

Abstract

The reasoning frontier of Large Language Models (LLMs) has advanced significantly through modern post-training paradigms (e.g., Reinforcement Learning from Verifiable Rewards (RLVR)). However, the efficacy of these methods remains fundamentally constrained by the diversity and complexity of the training data. One practical solution is data synthesis; yet, prevalent methods relying on unstructured mutation or exploration suffer from homogeneity collapse, failing to systematically expand the reasoning frontier. To overcome this, we propose Evoutionary Task Discovery (EvoTD), a framework that treats data synthesis as a directed search over a dual-axis manifold of Algorithmic Skills and Complexity Attributes. We introduce structured evolutionary operators to navigate this space: a Crossover operator that synthesizes novel skill compositions to enhance diversity, and a Parametric Mutation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liqinye/EvoTD
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.