Epistemic Filtering and Collective Hallucination: A Jury Theorem for Confidence-Calibrated Agents

Jonas Karge

arXiv:2602.22413·cs.AI·April 2, 2026

Epistemic Filtering and Collective Hallucination: A Jury Theorem for Confidence-Calibrated Agents

Jonas Karge

PDF

TL;DR

This paper introduces a probabilistic framework where agents learn their reliability and abstain when unsure, enhancing collective decision accuracy beyond classical voting theorems, with applications to AI safety.

Contribution

It generalizes the Condorcet Jury Theorem to a sequential, confidence-based setting allowing agents to abstain, supported by theoretical bounds and empirical validation.

Findings

01

Derived a lower bound on group success probability with selective participation.

02

Proved that confidence gating extends asymptotic guarantees of classical voting theorems.

03

Validated bounds through Monte Carlo simulations.

Abstract

We investigate the collective accuracy of heterogeneous agents who learn to estimate their own reliability over time and selectively abstain from voting. While classical epistemic voting results, such as the \textit{Condorcet Jury Theorem} (CJT), assume fixed participation, real-world aggregation often benefits from allowing agents to say ``I don't know.'' We propose a probabilistic framework where agents engage in a \textit{calibration} phase, updating beliefs about their own fixed competence, before facing a final confidence gate that determines whether to vote or abstain. We derive a non-asymptotic lower bound on the group's success probability and prove that this \textit{selective participation} generalizes the asymptotic guarantees of the CJT to a sequential, confidence-gated setting. Empirically, we validate these bounds via Monte Carlo simulations. While our results are general,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.