Citation-Enhanced Generation for LLM-based Chatbots
Weitao Li, Junkai Li, Weizhi Ma, Yang Liu

TL;DR
This paper introduces a post-hoc Citation-Enhanced Generation method for LLM chatbots that reduces hallucinations by verifying and regenerating responses with supporting citations, without additional training.
Contribution
It presents a training-free, plug-and-play retrieval and inference-based approach to mitigate hallucinations in LLM chatbots post-generation.
Findings
Outperforms state-of-the-art hallucination detection methods
Effective in regenerating responses with supported citations
Works across various LLMs and benchmarks
Abstract
Large language models (LLMs) exhibit powerful general intelligence across diverse scenarios, including their integration into chatbots. However, a vital challenge of LLM-based chatbots is that they may produce hallucinated content in responses, which significantly limits their applicability. Various efforts have been made to alleviate hallucination, such as retrieval augmented generation and reinforcement learning with human feedback, but most of them require additional training and data annotation. In this paper, we propose a novel post-hoc Citation-Enhanced Generation (CEG) approach combined with retrieval argumentation. Unlike previous studies that focus on preventing hallucinations during generation, our method addresses this issue in a post-hoc way. It incorporates a retrieval module to search for supporting documents relevant to the generated content, and employs a natural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling
MethodsFocus
