Ask a Local: Detecting Hallucinations With Specialized Model Divergence
Aldan Creo, H\'ector Cerezo-Costas, Pedro Alonso-Doval, Maximiliano Hormaz\'abal-Lagos

TL;DR
The paper introduces 'Ask a Local', a multilingual hallucination detection method for LLMs that leverages divergence in specialized model perplexities, demonstrating consistent cross-lingual performance without needing adaptation or retraining.
Contribution
It presents a novel divergence-based approach for multilingual hallucination detection that scales efficiently across languages without requiring language-specific training.
Findings
Consistent IoU scores around 0.3 across 14 languages.
Strong performance on Italian and Catalan with IoU scores of 0.42 and 0.38.
Effective cross-lingual detection without language-specific adaptations.
Abstract
Hallucinations in large language models (LLMs) - instances where models generate plausible but factually incorrect information - present a significant challenge for AI. We introduce "Ask a Local", a novel hallucination detection method exploiting the intuition that specialized models exhibit greater surprise when encountering domain-specific inaccuracies. Our approach computes divergence between perplexity distributions of language-specialized models to identify potentially hallucinated spans. Our method is particularly well-suited for a multilingual context, as it naturally scales to multiple languages without the need for adaptation, relying on external data sources, or performing training. Moreover, we select computationally efficient models, providing a scalable solution that can be applied to a wide range of languages and domains. Our results on a human-annotated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Mental Health via Writing · Adversarial Robustness in Machine Learning
