Manifold-based Sampling for In-Context Hallucination Detection in Large Language Models

Bodla Krishna Vamshi; Rohan Bhatnagar; Haizhao Yang

arXiv:2601.06196·cs.LG·January 22, 2026

Manifold-based Sampling for In-Context Hallucination Detection in Large Language Models

Bodla Krishna Vamshi, Rohan Bhatnagar, Haizhao Yang

PDF

Open Access

TL;DR

This paper introduces MB-ICL, a manifold-based demonstration sampling method that enhances in-context hallucination detection in large language models by leveraging latent representations and prototype geometry, outperforming traditional heuristics.

Contribution

The paper presents a novel manifold-based demonstration sampling framework for ICL that improves robustness and accuracy in hallucination detection without modifying LLM parameters.

Findings

01

MB-ICL outperforms standard ICL baselines on FEVER and HaluEval benchmarks.

02

The method shows strong gains in dialogue and summarization tasks.

03

MB-ICL remains robust under temperature perturbations and model variations.

Abstract

Large language models (LLMs) frequently generate factually incorrect or unsupported content, commonly referred to as hallucinations. Prior work has explored decoding strategies, retrieval augmentation, and supervised fine-tuning for hallucination detection, while recent studies show that in-context learning (ICL) can substantially influence factual reliability. However, existing ICL demonstration selection methods often rely on surface-level similarity heuristics and exhibit limited robustness across tasks and models. We propose MB-ICL, a manifold-based demonstration sampling framework for selecting in-context demonstrations that leverages latent representations extracted from frozen LLMs. By jointly modeling local manifold structure and class-aware prototype geometry, MB-ICL selects demonstrations based on their proximity to learned prototypes rather than lexical or embedding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Misinformation and Its Impacts · Topic Modeling