LENS: Learning Ensemble Confidence from Neural States for Multi-LLM Answer Integration

Jizhou Guo

arXiv:2507.23167·cs.CL·August 1, 2025

LENS: Learning Ensemble Confidence from Neural States for Multi-LLM Answer Integration

Jizhou Guo

PDF

Open Access

TL;DR

LENS introduces a novel ensemble method that learns to estimate model confidence from internal neural states, improving multi-LLM answer integration without modifying models or adding significant computation.

Contribution

It proposes a lightweight confidence predictor using internal representations to enhance ensemble performance of multiple LLMs.

Findings

01

LENS outperforms traditional ensemble methods on question-answering tasks.

02

Internal neural states provide valuable signals for confidence estimation.

03

The method requires negligible additional computation.

Abstract

Large Language Models (LLMs) have demonstrated impressive performance across various tasks, with different models excelling in distinct domains and specific abilities. Effectively combining the predictions of multiple LLMs is crucial for enhancing system robustness and performance. However, existing ensemble methods often rely on simple techniques like voting or logits ensembling, which overlook the varying confidence and reliability of models in different contexts. In this work, we propose LENS (Learning ENsemble confidence from Neural States), a novel approach that learns to estimate model confidence by analyzing internal representations. For each LLM, we train a lightweight linear confidence predictor that leverages layer-wise hidden states and normalized probabilities as inputs. This allows for more nuanced weighting of model predictions based on their context-dependent reliability.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques