Agentic Proposing: Enhancing Large Language Model Reasoning via Compositional Skill Synthesis

Zhengbo Jiao; Shaobo Wang; Zifan Zhang; Xuan Ren; Wei Wang; Bing Zhao; Hu Wei; Linfeng Zhang

arXiv:2602.03279·cs.AI·February 4, 2026

Agentic Proposing: Enhancing Large Language Model Reasoning via Compositional Skill Synthesis

Zhengbo Jiao, Shaobo Wang, Zifan Zhang, Xuan Ren, Wei Wang, Bing Zhao, Hu Wei, Linfeng Zhang

PDF

Open Access

TL;DR

This paper introduces Agentic Proposing, a novel framework for synthesizing high-quality, verifiable training data for large language models by dynamically composing modular reasoning skills, leading to significant improvements in reasoning tasks.

Contribution

It presents a goal-driven, agent-based approach to problem synthesis that balances complexity and validity, enabling the creation of effective training data with fewer instances.

Findings

01

Models trained on synthesized data outperform baselines.

02

Achieves 91.6% accuracy on AIME25 with only 11,000 trajectories.

03

Synthetic data rivals proprietary models like GPT-5.

Abstract

Advancing complex reasoning in large language models relies on high-quality, verifiable datasets, yet human annotation remains cost-prohibitive and difficult to scale. Current synthesis paradigms often face a recurring trade-off: maintaining structural validity typically restricts problem complexity, while relaxing constraints to increase difficulty frequently leads to inconsistent or unsolvable instances. To address this, we propose Agentic Proposing, a framework that models problem synthesis as a goal-driven sequential decision process where a specialized agent dynamically selects and composes modular reasoning skills. Through an iterative workflow of internal reflection and tool-use, we develop the Agentic-Proposer-4B using Multi-Granularity Policy Optimization (MGPO) to generate high-precision, verifiable training trajectories across mathematics, coding, and science. Empirical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Machine Learning in Materials Science