Adaptive Problem Generation via Symbolic Representations
Teresa Yeo, Myeongho Jeon, Dulaj Weerakoon, Rui Qiao, Alok Prakash, Armando Solar-Lezama, Archan Misra

TL;DR
This paper introduces a symbolic, adaptive problem generation method for reinforcement learning that enhances small language models' mathematical reasoning by controlling problem structure and difficulty.
Contribution
It proposes a symbolic problem space for precise control and an adaptive framework that learns to modify problems, improving mathematical reasoning in language models.
Findings
Adaptive problem generation improves model performance.
Symbolic representations enable diverse and controlled problem creation.
Learning modification strategies enhances math-solving ability.
Abstract
We present a method for generating training data for reinforcement learning with verifiable rewards to improve small open-weights language models on mathematical tasks. Existing data generation approaches rely on open-loop pipelines and fixed modifications that do not adapt to the model's capabilities. Furthermore, they typically operate directly on word problems, limiting control over problem structure. To address this, we perform modifications in a symbolic problem space, representing each problem as a set of symbolic variables and constraints (e.g., via algebraic frameworks such as SymPy or SMT formulations). This representation enables precise control over problem structure, automatic generation of ground-truth solutions, and decouples mathematical reasoning from linguistic realization. We also show that this results in more diverse generations. To adapt the problem difficulty to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Topic Modeling · Multimodal Machine Learning Applications
