From Implicit to Explicit: Token-Efficient Logical Supervision for Mathematical Reasoning in LLMs

Shaojie Wang; Liang Zhang

arXiv:2601.03682·cs.CL·April 21, 2026

From Implicit to Explicit: Token-Efficient Logical Supervision for Mathematical Reasoning in LLMs

Shaojie Wang, Liang Zhang

PDF

TL;DR

This paper introduces FSLR, a lightweight training framework that explicitly teaches logical relationship understanding to improve mathematical reasoning in LLMs, outperforming prior methods and reducing training costs.

Contribution

FSLR is a novel approach that explicitly supervises the first planning step in logical reasoning, addressing a key limitation of existing fine-tuning methods.

Findings

01

FSLR outperforms CoT-SFT with 3.2% and 4.6% accuracy improvements.

02

FSLR reduces training token consumption by over 80%.

03

FSLR achieves 4-6x faster training.

Abstract

Recent studies reveal that large language models (LLMs) exhibit limited logical reasoning abilities in mathematical problem-solving, instead often relying on pattern-matching and memorization. We systematically analyze this limitation, focusing on logical relationship understanding, which is a core capability underlying genuine logical reasoning, and reveal that errors related to this capability account for over 90\% of incorrect predictions, with Chain-of-Thought Supervised Fine-Tuning (CoT-SFT) failing to substantially reduce these errors. To address this bottleneck, we propose First-Step Logical Reasoning (FSLR), a lightweight training framework targeting logical relationship understanding. Our key insight is that the first planning step-identifying which variables to use and which operation to apply-encourages the model to derive logical relationships directly from the problem…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.