Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs

Paula Cordero-Encinar; Andrew B. Duncan

arXiv:2510.17472·stat.ML·October 24, 2025

Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs

Paula Cordero-Encinar, Andrew B. Duncan

PDF

Open Access 1 Models

TL;DR

This paper introduces a unified statistical framework for certifiable inference in large language models, connecting self-consistency and test-time reinforcement learning, and providing guarantees for reliable reasoning without extra supervision.

Contribution

It develops a certifiable inference framework, introduces the Martingale Majority Certificate, and links self-consistency with TTRL under a common statistical perspective.

Findings

01

Majority voting provides a high-probability certificate of self-consistency.

02

Finite-sample and anytime-valid bounds quantify confidence levels.

03

Post-training methods like TTRL sharpen answer distributions, reducing certification samples.

Abstract

Recent advances such as self-consistency and test-time reinforcement learning (TTRL) improve the reliability of large language models (LLMs) without additional supervision, yet their underlying mechanisms and statistical guarantees remain poorly understood. We present a unified framework for certifiable inference in LLMs, showing that majority voting provides a statistical certificate of self-consistency: under mild assumptions, the aggregated answer coincides with the mode of the model's terminal distribution with high probability. We derive finite-sample and anytime-valid concentration bounds that quantify this confidence, and introduce the Martingale Majority Certificate (MMC), a sequential stopping rule that adaptively determines when sufficient samples have been drawn. We further prove that label-free post-training methods such as TTRL implicitly sharpen the answer distribution by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
paulaoak/certified_self_consistency
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Algorithms