Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis

Yongxian Wei; Yilin Zhao; Zixuan Hu; Li Shen; Xinrui Chen; Runxi Cheng; Sinan Du; Hao Yu; Chun Yuan; Dian Li

arXiv:2511.09907·cs.AI·May 11, 2026

Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis

Yongxian Wei, Yilin Zhao, Zixuan Hu, Li Shen, Xinrui Chen, Runxi Cheng, Sinan Du, Hao Yu, Chun Yuan, Dian Li

PDF

TL;DR

This paper introduces a reasoning-driven, solver-adaptive data synthesis method for training reasoning models, improving problem quality and difficulty calibration across multiple benchmarks.

Contribution

It presents a novel problem generator that explicitly reasons about problem directions and adapts difficulty based on solver feedback, enhancing data quality for reasoning models.

Findings

01

Achieves a 3.4% average improvement on 10 reasoning benchmarks.

02

Effectively calibrates problem difficulty to the solver's ability.

03

Demonstrates robust generalization across language and vision-language models.

Abstract

Data synthesis for training large reasoning models offers a scalable alternative to limited, human-curated datasets, enabling the creation of high-quality data. However, existing approaches face several challenges: (i) indiscriminate generation that ignores the solver's ability and yields low-value problems, or reliance on complex data pipelines to balance problem difficulty; and (ii) a lack of reasoning in problem generation, leading to shallow problem variants. In this paper, we develop a problem generator that reasons explicitly to plan problem directions before synthesis and adapts difficulty to the solver's ability. Specifically, we construct related problem pairs and augment them with intermediate problem-design CoT produced by a reasoning model. These data are used to bootstrap problem-design strategies in the generator. Then, we treat the solver's feedback on synthetic problems…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.