Learning to Trust the Crowd: A Multi-Model Consensus Reasoning Engine for Large Language Models
Pranav Kallem

TL;DR
This paper introduces a multi-model consensus reasoning engine that combines responses from various large language models to improve answer accuracy and reliability through supervised learning techniques.
Contribution
It presents a novel supervised meta-learner that integrates diverse features from multiple LLM outputs to enhance correctness and reduce hallucinations.
Findings
Improves accuracy by 4.6 percentage points over the best single LLM.
Reduces hallucinations and Brier scores.
Semantic agreement features are most influential.
Abstract
Large language models (LLMs) achieve strong average performance yet remain unreliable at the instance level, with frequent hallucinations, brittle failures, and poorly calibrated confidence. We study reliability through the lens of multi-model consensus: given responses from several heterogeneous LLMs, can we learn which answer is most likely correct for a given query? We introduce a Multi-Model Consensus Reasoning Engine that treats the set of LLM outputs as input to a supervised meta-learner. The system maps natural language responses into structured features using semantic embeddings, pairwise similarity and clustering statistics, lexical and structural cues, reasoning-quality scores, confidence estimates, and model-specific priors, and then applies gradient-boosted trees, listwise ranking, and graph neural networks over similarity graphs of answers. Using three open-weight LLMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Explainable Artificial Intelligence (XAI)
