Reliability-Aware Adaptive Self-Consistency for Efficient Sampling in LLM Reasoning

Junseok Kim; Nakyeong Yang; Kyungmin Min; Kyomin Jung

arXiv:2601.02970·cs.CL·April 21, 2026

Reliability-Aware Adaptive Self-Consistency for Efficient Sampling in LLM Reasoning

Junseok Kim, Nakyeong Yang, Kyungmin Min, Kyomin Jung

PDF

TL;DR

ReASC is a novel method that enhances reasoning reliability in large language models by adaptively sampling responses based on evidence sufficiency, significantly reducing inference costs while maintaining accuracy.

Contribution

It introduces a reliability-aware adaptive sampling framework that improves efficiency by using response confidence and evidence sufficiency instead of count-based rules.

Findings

01

ReASC achieves the best accuracy-cost trade-off across five models and four datasets.

02

It reduces inference cost by up to 70% on GSM8K while preserving accuracy.

03

ReASC improves inference efficiency across model scales from 3B to 27B parameters.

Abstract

Self-Consistency improves reasoning reliability through multi-sample aggregation, but incurs substantial inference cost. Adaptive self-consistency methods mitigate this issue by adjusting the sampling budget; however, they rely on count-based stopping rules that treat all responses equally, often leading to unnecessary sampling. We propose Reliability-Aware Adaptive Self-Consistency (ReASC), which addresses this limitation by reframing adaptive sampling from response counting to evidence sufficiency, leveraging response-level confidence for principled information aggregation. ReASC operates in two stages: a single-sample decision stage that resolves instances confidently answerable from a single response, and a reliability-aware accumulation stage that aggregates responses by jointly leveraging their frequency and confidence. Across five models and four datasets, ReASC consistently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.