Rethinking Supervision Granularity: Segment-Level Learning for LLM-Based Theorem Proving

Shuo Xu; Jiakun Zhang; Junyu Lai; Chun Cao; Jingwei Xu

arXiv:2605.11905·cs.AI·May 13, 2026

Rethinking Supervision Granularity: Segment-Level Learning for LLM-Based Theorem Proving

Shuo Xu, Jiakun Zhang, Junyu Lai, Chun Cao, Jingwei Xu

PDF

1 Repo

TL;DR

This paper introduces segment-level supervision for training LLM-based theorem provers, improving proof success rates by better aligning training data granularity with proof structure.

Contribution

It proposes a novel segment-level supervision strategy that enhances proof success and inference efficiency over traditional step-level or whole-proof methods.

Findings

01

Segment-level supervision achieves higher proof success rates on multiple datasets.

02

Goal-aware rollouts improve existing provers' success rates and reduce inference costs.

03

Code and models are publicly available at the provided GitHub repository.

Abstract

Automated theorem proving with large language models in Lean 4 is commonly approached through either step-level tactic prediction with tree search or whole-proof generation. These two paradigms represent opposite granularities for constructing supervised training data: the former provides dense local signals but may fragment coherent proof processes, while the latter preserves global structure but requires complex end-to-end generation. In this paper, we revisit supervision granularity as a training set construction problem over proof trajectories and propose segment-level supervision, a training data construction strategy that extracts locally coherent proof segments for training policy models. We further reuse the same strategy at inference time to trigger short rollouts for existing step-level models. When trained with segment-level supervision on STP, LeanWorkbook, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NJUDeepEngine/SEG-ATP
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.