SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms

Alex Havrilla; Edward Hughes; Mikayel Samvelyan; Jacob Abernethy

arXiv:2506.06499·cs.LG·June 18, 2025

SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms

Alex Havrilla, Edward Hughes, Mikayel Samvelyan, Jacob Abernethy

PDF

Open Access

TL;DR

SPARQ introduces a novel method for generating high-quality, diverse synthetic math problems using a single model and quality-diversity algorithms, significantly enhancing reasoning capabilities and generalization of language models.

Contribution

The paper presents SPARQ, a new approach for scalable synthetic problem generation that improves model performance and generalization by filtering for difficulty and diversity.

Findings

01

Filtering by problem difficulty improves in-distribution performance.

02

Diverse synthetic data enhances out-of-distribution robustness.

03

Scaling laws exist for synthetic problems, benefiting model generalization.

Abstract

Large language model (LLM) driven synthetic data generation has emerged as a powerful method for improving model reasoning capabilities. However, most methods either distill large state-of-the-art models into small students or use natural ground-truth problem statements to guarantee problem statement quality. This limits the scalability of these approaches to more complex and diverse problem domains. To address this, we present SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms, a novel approach for generating high-quality and diverse synthetic math problem and solution pairs using only a single model by measuring a problem's solve-rate: a proxy for problem difficulty. Starting from a seed dataset of 7.5K samples, we generate over 20 million new problem-solution pairs. We show that filtering the generated data by difficulty and then fine-tuning the same…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · AI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge