Check Your Facts and Try Again: Improving Large Language Models with   External Knowledge and Automated Feedback

Baolin Peng; Michel Galley; Pengcheng He; Hao Cheng; Yujia; Xie; Yu Hu; Qiuyuan Huang; Lars Liden; Zhou Yu; Weizhu Chen; and Jianfeng Gao

arXiv:2302.12813·cs.CL·March 10, 2023·147 cites

Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback

Baolin Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia, Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, and Jianfeng Gao

PDF

Open Access

TL;DR

This paper introduces LLM-Augmenter, a system that enhances large language models by grounding responses in external knowledge and using automated feedback to reduce hallucinations while maintaining response quality.

Contribution

It presents a modular system that improves LLM factuality and response accuracy through external knowledge integration and iterative prompt refinement.

Findings

01

Significantly reduces hallucinations in ChatGPT responses

02

Maintains fluency and informativeness of generated responses

03

Validated on dialog and question answering tasks

Abstract

Large language models (LLMs), such as ChatGPT, are able to generate human-like, fluent responses for many downstream tasks, e.g., task-oriented dialog and question answering. However, applying LLMs to real-world, mission-critical applications remains challenging mainly due to their tendency to generate hallucinations and their inability to use external knowledge. This paper proposes a LLM-Augmenter system, which augments a black-box LLM with a set of plug-and-play modules. Our system makes the LLM generate responses grounded in external knowledge, e.g., stored in task-specific databases. It also iteratively revises LLM prompts to improve model responses using feedback generated by utility functions, e.g., the factuality score of a LLM-generated response. The effectiveness of LLM-Augmenter is empirically validated on two types of scenarios, task-oriented dialog and open-domain question…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Natural Language Processing Techniques