Epistemic Blinding: An Inference-Time Protocol for Auditing Prior Contamination in LLM-Assisted Analysis
Michael Cuccarese

TL;DR
This paper introduces epistemic blinding, an inference-time protocol that enhances auditability of LLM outputs by distinguishing data-driven inference from model memorized priors, demonstrated in biological and financial contexts.
Contribution
It proposes a simple, open-source protocol for blind entity anonymization that allows measuring the influence of data versus model memory in LLM reasoning.
Findings
Blinding alters 16% of top-20 predictions in drug target prioritization.
In equity screening, brand bias reshapes 30-40% of top-20 rankings.
The protocol enables auditability without requiring deterministic reasoning.
Abstract
This paper presents epistemic blinding in the context of an agentic system that uses large language models to reason across multiple biological datasets for drug target prioritization. During development, it became apparent that LLM outputs silently blend data-driven inference with memorized priors about named entities - and the blend is invisible: there is no way to determine, from a single output, how much came from the data on the page and how much came from the model's training memory. Epistemic blinding is a simple inference-time protocol that replaces entity identifiers with anonymous codes before prompting, then compares outputs against an unblinded control. The protocol does not make LLM reasoning deterministic, but it restores one critical axis of auditability: measuring how much of an output came from the supplied data versus the model's parametric knowledge. The complete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
