Ask a Local: Detecting Hallucinations With Specialized Model Divergence

Aldan Creo; H\'ector Cerezo-Costas; Pedro Alonso-Doval; Maximiliano Hormaz\'abal-Lagos

arXiv:2506.03357·cs.CL·June 5, 2025

Ask a Local: Detecting Hallucinations With Specialized Model Divergence

Aldan Creo, H\'ector Cerezo-Costas, Pedro Alonso-Doval, Maximiliano Hormaz\'abal-Lagos

PDF

Open Access

TL;DR

The paper introduces 'Ask a Local', a multilingual hallucination detection method for LLMs that leverages divergence in specialized model perplexities, demonstrating consistent cross-lingual performance without needing adaptation or retraining.

Contribution

It presents a novel divergence-based approach for multilingual hallucination detection that scales efficiently across languages without requiring language-specific training.

Findings

01

Consistent IoU scores around 0.3 across 14 languages.

02

Strong performance on Italian and Catalan with IoU scores of 0.42 and 0.38.

03

Effective cross-lingual detection without language-specific adaptations.

Abstract

Hallucinations in large language models (LLMs) - instances where models generate plausible but factually incorrect information - present a significant challenge for AI. We introduce "Ask a Local", a novel hallucination detection method exploiting the intuition that specialized models exhibit greater surprise when encountering domain-specific inaccuracies. Our approach computes divergence between perplexity distributions of language-specialized models to identify potentially hallucinated spans. Our method is particularly well-suited for a multilingual context, as it naturally scales to multiple languages without the need for adaptation, relying on external data sources, or performing training. Moreover, we select computationally efficient models, providing a scalable solution that can be applied to a wide range of languages and domains. Our results on a human-annotated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Mental Health via Writing · Adversarial Robustness in Machine Learning