Bridging Internal Probability and Self-Consistency for Effective and   Efficient LLM Reasoning

Zhi Zhou; Tan Yuhao; Zenan Li; Yuan Yao; Lan-Zhe Guo; Xiaoxing Ma,; Yu-Feng Li

arXiv:2502.00511·cs.LG·February 14, 2025

Bridging Internal Probability and Self-Consistency for Effective and Efficient LLM Reasoning

Zhi Zhou, Tan Yuhao, Zenan Li, Yuan Yao, Lan-Zhe Guo, Xiaoxing Ma,, Yu-Feng Li

PDF

Open Access

TL;DR

This paper introduces a theoretical analysis of reasoning techniques in large language models, identifies their limitations, and proposes RPC, a method that improves reasoning accuracy and efficiency by combining perplexity and self-consistency.

Contribution

It provides the first theoretical error decomposition of perplexity and self-consistency methods and introduces RPC, a novel approach that enhances reasoning performance and sample efficiency.

Findings

01

RPC accelerates convergence of estimation error

02

RPC reduces model error effectively

03

Empirical results show improved reasoning accuracy

Abstract

Recent advancements in large language models (LLMs) have demonstrated remarkable reasoning capabilities. However, single-shot inference often yields unreliable results for complex reasoning tasks, leading researchers to explore multiple reasoning paths through methods such as perplexity and self-consistency. In this paper, we present the first theoretical error decomposition analysis of these techniques, breaking down their error into estimation error and model error. Our analysis reveals a fundamental trade-off: perplexity methods suffer from substantial model error due to the absence of a proper consistency function, while self-consistency exhibits high estimation error due to a slow error convergence rate. To overcome these limitations, we propose Reasoning-Pruning Perplexity Consistency (RPC). This approach combines Perplexity Consistency, which seamlessly integrates LLM perplexity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies

MethodsPruning