Refining Answer Distributions for Improved Large Language Model   Reasoning

Soumyasundar Pal; Didier Ch\'etelat; Yingxue Zhang; Mark Coates

arXiv:2412.13292·cs.CL·April 11, 2025

Refining Answer Distributions for Improved Large Language Model Reasoning

Soumyasundar Pal, Didier Ch\'etelat, Yingxue Zhang, Mark Coates

PDF

Open Access

TL;DR

This paper introduces Refined Answer Distributions, a new iterative sampling method that improves large language model reasoning by better combining multiple responses to identify the most probable answer.

Contribution

It proposes a novel algorithmic framework that enhances LLM reasoning by iteratively approximating answer distributions to find the most likely solution.

Findings

01

Outperforms existing combination strategies like self-consistency.

02

Effectively identifies the mode of answer distributions.

03

Demonstrates superior performance on reasoning benchmarks.

Abstract

Large Language Models (LLMs) have exhibited an impressive capability to perform reasoning tasks, especially if they are encouraged to generate a sequence of intermediate steps. Reasoning performance can be improved by suitably combining multiple LLM responses, generated either in parallel in a single query, or via sequential interactions with LLMs throughout the reasoning process. Existing strategies for combination, such as self-consistency and progressive-hint-prompting, make inefficient usage of the LLM responses. We present Refined Answer Distributions, a novel and principled algorithmic framework to enhance the reasoning capabilities of LLMs. Our approach can be viewed as an iterative sampling strategy for forming a Monte Carlo approximation of an underlying distribution of answers, with the goal of identifying the mode -- the most likely answer. Empirical evaluation on several…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsHierarchical Information Threading