Learning to Trust the Crowd: A Multi-Model Consensus Reasoning Engine for Large Language Models

Pranav Kallem

arXiv:2601.07245·cs.AI·January 13, 2026

Learning to Trust the Crowd: A Multi-Model Consensus Reasoning Engine for Large Language Models

Pranav Kallem

PDF

Open Access

TL;DR

This paper introduces a multi-model consensus reasoning engine that combines responses from various large language models to improve answer accuracy and reliability through supervised learning techniques.

Contribution

It presents a novel supervised meta-learner that integrates diverse features from multiple LLM outputs to enhance correctness and reduce hallucinations.

Findings

01

Improves accuracy by 4.6 percentage points over the best single LLM.

02

Reduces hallucinations and Brier scores.

03

Semantic agreement features are most influential.

Abstract

Large language models (LLMs) achieve strong average performance yet remain unreliable at the instance level, with frequent hallucinations, brittle failures, and poorly calibrated confidence. We study reliability through the lens of multi-model consensus: given responses from several heterogeneous LLMs, can we learn which answer is most likely correct for a given query? We introduce a Multi-Model Consensus Reasoning Engine that treats the set of LLM outputs as input to a supervised meta-learner. The system maps natural language responses into structured features using semantic embeddings, pairwise similarity and clustering statistics, lexical and structural cues, reasoning-quality scores, confidence estimates, and model-specific priors, and then applies gradient-boosted trees, listwise ranking, and graph neural networks over similarity graphs of answers. Using three open-weight LLMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Explainable Artificial Intelligence (XAI)