Investigating Hallucination in Conversations for Low Resource Languages

Amit Das; Md. Najib Hasan; Souvika Sarkar; Zheng Zhang; Fatemeh Jamshidi; Tathagata Bhattacharya; Nilanjana Raychawdhury; Dongji Feng; Vinija Jain; Aman Chadha

arXiv:2507.22720·cs.CL·November 20, 2025

Investigating Hallucination in Conversations for Low Resource Languages

Amit Das, Md. Najib Hasan, Souvika Sarkar, Zheng Zhang, Fatemeh Jamshidi, Tathagata Bhattacharya, Nilanjana Raychawdhury, Dongji Feng, Vinija Jain, Aman Chadha

PDF

TL;DR

This paper investigates hallucination issues in conversational LLMs across Hindi, Farsi, and Mandarin, revealing language-dependent differences in factual accuracy and analyzing multiple models' performance.

Contribution

It extends hallucination analysis to low-resource languages in conversational settings, providing a comprehensive dataset and comparison across several LLMs.

Findings

01

Fewer hallucinations in Mandarin responses

02

Higher hallucination rates in Hindi and Farsi

03

Model performance varies significantly by language

Abstract

Large Language Models (LLMs) have demonstrated remarkable proficiency in generating text that closely resemble human writing. However, they often generate factually incorrect statements, a problem typically referred to as 'hallucination'. Addressing hallucination is crucial for enhancing the reliability and effectiveness of LLMs. While much research has focused on hallucinations in English, our study extends this investigation to conversational data in three languages: Hindi, Farsi, and Mandarin. We offer a comprehensive analysis of a dataset to examine both factual and linguistic errors in these languages for GPT-3.5, GPT-4o, Llama-3.1, Gemma-2.0, DeepSeek-R1 and Qwen-3. We found that LLMs produce very few hallucinated responses in Mandarin but generate a significantly higher number of hallucinations in Hindi and Farsi.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.