Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment
Xinyu Zhang

TL;DR
This paper introduces NSRSA, a method that stabilizes recursive self-training by integrating symbolic verification to ensure reasoning quality, thereby reducing errors and improving model performance in iterative self-improvement tasks.
Contribution
NSRSA is the first framework to embed symbolic verification into recursive self-training, effectively filtering flawed reasoning and enhancing model reliability.
Findings
NSRSA rejects 34% of solutions with flawed reasoning.
Constructing DPO preferences from NSRSA improves reward accuracy from 46% to 63%.
NSRSA reduces the impact of lucky guesses in self-training data.
Abstract
Recursive self-improvement--where a model iteratively trains on its own outputs--promises sustained capability growth but faces a fundamental obstacle: recursive drift. As models train on self-generated data across multiple iterations, errors in intermediate reasoning compound, leading to mode collapse and performance degradation. We propose Neuro-Symbolic Recursive Self-Alignment (NSRSA), which stabilizes iterative self-training by embedding a symbolic verification subsystem that gates training data quality at the reasoning step level. Unlike outcome-only filtering (which admits "lucky guesses" with flawed reasoning), NSRSA verifies each arithmetic operation via sympy, checks logical flow consistency across reasoning steps, and enforces domain constraints. We evaluate NSRSA on GSM8K using Qwen3-4B-Thinking across 5 self-training iterations under five conditions: no verification,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning
