VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection

James Petullo; Sonny George; Dylan Cashman; Nianwen Xue

arXiv:2605.08070·cs.AI·May 11, 2026

VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection

James Petullo, Sonny George, Dylan Cashman, Nianwen Xue

PDF

1 Datasets

TL;DR

VecCISC enhances confidence-informed self-consistency in reasoning tasks by clustering reasoning traces to reduce computational costs while maintaining high accuracy across diverse datasets.

Contribution

It introduces a semantic similarity-based filtering method to reduce the number of reasoning traces evaluated, lowering token usage without sacrificing accuracy.

Findings

01

VecCISC reduces token usage by 47%.

02

Maintains or exceeds CISC accuracy across five datasets.

03

Effective across domains like mathematics, chemistry, and humanities.

Abstract

A standard technique for scaling inference-time reasoning is Self-Consistency, whereby multiple candidate answers are sampled from an LLM and the most common answer is selected. More recently, it has been shown that weighted majority voting (e.g. Confidence-Informed Self Consistency (CISC)), which assigns a confidence value to each candidate answer and chooses the answer with the largest accumulated score, tends to be more accurate on a wide range of popular benchmarks. In practice, weighted majority voting necessitates calling a critic LLM on each candidate's reasoning trace to produce the answer's confidence score. This secondary series of LLM calls greatly increases the overhead and cost of weighted majority voting, despite its potential performance benefits. To reduce this expense, we propose VecCISC, a lightweight, adaptive framework that uses a measure of semantic similarity to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Daybreak159/arxivclaw-paper-link-test-2605-08070-20260513-202923
dataset· 37 dl
37 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.