Manifold-based Sampling for In-Context Hallucination Detection in Large Language Models
Bodla Krishna Vamshi, Rohan Bhatnagar, Haizhao Yang

TL;DR
This paper introduces MB-ICL, a manifold-based demonstration sampling method that enhances in-context hallucination detection in large language models by leveraging latent representations and prototype geometry, outperforming traditional heuristics.
Contribution
The paper presents a novel manifold-based demonstration sampling framework for ICL that improves robustness and accuracy in hallucination detection without modifying LLM parameters.
Findings
MB-ICL outperforms standard ICL baselines on FEVER and HaluEval benchmarks.
The method shows strong gains in dialogue and summarization tasks.
MB-ICL remains robust under temperature perturbations and model variations.
Abstract
Large language models (LLMs) frequently generate factually incorrect or unsupported content, commonly referred to as hallucinations. Prior work has explored decoding strategies, retrieval augmentation, and supervised fine-tuning for hallucination detection, while recent studies show that in-context learning (ICL) can substantially influence factual reliability. However, existing ICL demonstration selection methods often rely on surface-level similarity heuristics and exhibit limited robustness across tasks and models. We propose MB-ICL, a manifold-based demonstration sampling framework for selecting in-context demonstrations that leverages latent representations extracted from frozen LLMs. By jointly modeling local manifold structure and class-aware prototype geometry, MB-ICL selects demonstrations based on their proximity to learned prototypes rather than lexical or embedding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Misinformation and Its Impacts · Topic Modeling
