VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection
James Petullo, Sonny George, Dylan Cashman, Nianwen Xue

TL;DR
VecCISC enhances confidence-informed self-consistency in reasoning tasks by clustering reasoning traces to reduce computational costs while maintaining high accuracy across diverse datasets.
Contribution
It introduces a semantic similarity-based filtering method to reduce the number of reasoning traces evaluated, lowering token usage without sacrificing accuracy.
Findings
VecCISC reduces token usage by 47%.
Maintains or exceeds CISC accuracy across five datasets.
Effective across domains like mathematics, chemistry, and humanities.
Abstract
A standard technique for scaling inference-time reasoning is Self-Consistency, whereby multiple candidate answers are sampled from an LLM and the most common answer is selected. More recently, it has been shown that weighted majority voting (e.g. Confidence-Informed Self Consistency (CISC)), which assigns a confidence value to each candidate answer and chooses the answer with the largest accumulated score, tends to be more accurate on a wide range of popular benchmarks. In practice, weighted majority voting necessitates calling a critic LLM on each candidate's reasoning trace to produce the answer's confidence score. This secondary series of LLM calls greatly increases the overhead and cost of weighted majority voting, despite its potential performance benefits. To reduce this expense, we propose VecCISC, a lightweight, adaptive framework that uses a measure of semantic similarity to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
