DSC2025 -- ViHallu Challenge: Detecting Hallucination in Vietnamese LLMs
Anh Thi-Hoang Nguyen, Khanh Quoc Tran, Tin Van Huynh, Phuoc Tan-Hoang Nguyen, Cam Tan Nguyen, Kiet Van Nguyen

TL;DR
This paper introduces the first large-scale benchmark for detecting hallucinations in Vietnamese LLMs, providing a dataset, challenge, and analysis to improve model reliability in low-resource language settings.
Contribution
It presents the ViHallu dataset and the DSC2025 ViHallu Challenge, establishing a new benchmark for hallucination detection in Vietnamese LLMs and analyzing diverse detection methods.
Findings
Best system achieved 84.80% macro-F1 score
Instruction-tuned LLMs outperform generic models
Detection of intrinsic hallucinations remains challenging
Abstract
The reliability of large language models (LLMs) in production environments remains significantly constrained by their propensity to generate hallucinations -- fluent, plausible-sounding outputs that contradict or fabricate information. While hallucination detection has recently emerged as a priority in English-centric benchmarks, low-to-medium resource languages such as Vietnamese remain inadequately covered by standardized evaluation frameworks. This paper introduces the DSC2025 ViHallu Challenge, the first large-scale shared task for detecting hallucinations in Vietnamese LLMs. We present the ViHallu dataset, comprising 10,000 annotated triplets of (context, prompt, response) samples systematically partitioned into three hallucination categories: no hallucination, intrinsic, and extrinsic hallucinations. The dataset incorporates three prompt types -- factual, noisy, and adversarial --…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Adversarial Robustness in Machine Learning · Psychedelics and Drug Studies
