TL;DR
EvoTD introduces a structured evolutionary framework for data synthesis, enhancing reasoning capabilities of LLMs by systematically expanding task diversity and complexity.
Contribution
The paper presents EvoTD, a novel evolutionary approach with structured operators and a dynamic filter to improve reasoning by systematically generating diverse, complex tasks.
Findings
EvoTD achieves significant reasoning improvements across various models.
Structured evolutionary operators enhance task diversity and complexity.
Code is publicly available at https://github.com/liqinye/EvoTD.
Abstract
The reasoning frontier of Large Language Models (LLMs) has advanced significantly through modern post-training paradigms (e.g., Reinforcement Learning from Verifiable Rewards (RLVR)). However, the efficacy of these methods remains fundamentally constrained by the diversity and complexity of the training data. One practical solution is data synthesis; yet, prevalent methods relying on unstructured mutation or exploration suffer from homogeneity collapse, failing to systematically expand the reasoning frontier. To overcome this, we propose Evoutionary Task Discovery (EvoTD), a framework that treats data synthesis as a directed search over a dual-axis manifold of Algorithmic Skills and Complexity Attributes. We introduce structured evolutionary operators to navigate this space: a Crossover operator that synthesizes novel skill compositions to enhance diversity, and a Parametric Mutation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
