Medico: Towards Hallucination Detection and Correction with Multi-source   Evidence Fusion

Xinping Zhao; Jindi Yu; Zhenyu Liu; Jifang Wang; Dongfang Li; Yibin; Chen; Baotian Hu; Min Zhang

arXiv:2410.10408·cs.CL·October 15, 2024

Medico: Towards Hallucination Detection and Correction with Multi-source Evidence Fusion

Xinping Zhao, Jindi Yu, Zhenyu Liu, Jifang Wang, Dongfang Li, Yibin, Chen, Baotian Hu, Min Zhang

PDF

Open Access 1 Video

TL;DR

Medico is an automated framework that detects and corrects hallucinations in LLM outputs by fusing multiple sources of evidence, significantly improving factual accuracy and providing explanations.

Contribution

It introduces a novel multi-source evidence fusion approach for automatic hallucination detection and correction in LLMs, enhancing factual reliability.

Findings

01

High accuracy in evidence retrieval (HR@5 0.964)

02

Effective hallucination detection (F1 0.927-0.951)

03

Successful content correction (approval rate 0.973-0.979)

Abstract

As we all know, hallucinations prevail in Large Language Models (LLMs), where the generated content is coherent but factually incorrect, which inflicts a heavy blow on the widespread application of LLMs. Previous studies have shown that LLMs could confidently state non-existent facts rather than answering ``I don't know''. Therefore, it is necessary to resort to external knowledge to detect and correct the hallucinated content. Since manual detection and correction of factual errors is labor-intensive, developing an automatic end-to-end hallucination-checking approach is indeed a needful thing. To this end, we present Medico, a Multi-source evidence fusion enhanced hallucination detection and correction framework. It fuses diverse evidence from multiple sources, detects whether the generated content contains factual errors, provides the rationale behind the judgment, and iteratively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Medico: Towards Hallucination Detection and Correction with Multi-source Evidence Fusion· underline

Taxonomy

TopicsTraditional Chinese Medicine Studies