VERIFY-RL: Verifiable Recursive Decomposition for Reinforcement Learning in Mathematical Reasoning

Kaleem Ullah Qasim; Jiashu Zhang; Hao Li; Muhammad Kafeel Shaheen

arXiv:2602.07559·cs.AI·February 10, 2026

VERIFY-RL: Verifiable Recursive Decomposition for Reinforcement Learning in Mathematical Reasoning

Kaleem Ullah Qasim, Jiashu Zhang, Hao Li, Muhammad Kafeel Shaheen

PDF

Open Access

TL;DR

Verify-RL introduces a mathematically grounded, verifiable decomposition framework for reinforcement learning in mathematical reasoning, significantly improving accuracy by ensuring valid problem breakdowns.

Contribution

It presents a novel verification-based decomposition method using symbolic differentiation, ensuring valid subproblem generation with provable properties, unlike heuristic approaches.

Findings

01

Accuracy on hardest problems more than doubles from 32% to 68%.

02

Eliminating invalid decompositions improves overall performance.

03

Framework provides automatic verification through symbolic computation.

Abstract

Training language models to solve complex mathematical problems benefits from curriculum learning progressively training on simpler subproblems. However, existing decomposition methods are often heuristic, offering no guarantees that subproblems are simpler, that solving them aids the parent task, or that their relationships are mathematically grounded. We observe that symbolic differentiation provides a natural structure for verified decomposition: calculus rules explicitly define how expressions reduce to simpler components with provable properties. We introduce Verify-RL, a framework where every parent-child decomposition satisfies three verifiable conditions: strictly decreasing structural complexity, solution containment, and formal rule derivation. Unlike heuristic methods where a significant fraction of decompositions are invalid our properties admit automatic verification…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Cognitive and developmental aspects of mathematical skills