Towards Reliable and Fluent Large Language Models: Incorporating   Feedback Learning Loops in QA Systems

Dongyub Lee; Taesun Whang; Chanhee Lee; Heuiseok Lim

arXiv:2309.06384·cs.CL·September 13, 2023·2 cites

Towards Reliable and Fluent Large Language Models: Incorporating Feedback Learning Loops in QA Systems

Dongyub Lee, Taesun Whang, Chanhee Lee, Heuiseok Lim

PDF

Open Access

TL;DR

This paper introduces a feedback learning loop using a critic model to evaluate and improve large language models' responses in QA systems, significantly enhancing citation accuracy and fluency.

Contribution

It develops a critic-based feedback mechanism and iterative training process to improve LLM response quality in terms of citation, correctness, and fluency.

Findings

01

4% increase in citation precision

02

8% improvement in fluency (MAUVE) metric

03

Maintains high correctness levels

Abstract

Large language models (LLMs) have emerged as versatile tools in various daily applications. However, they are fraught with issues that undermine their utility and trustworthiness. These include the incorporation of erroneous references (citation), the generation of hallucinated information (correctness), and the inclusion of superfluous or omission of crucial details (fluency). To ameliorate these concerns, this study makes several key contributions. First, we build a dataset to train a critic model capable of evaluating the citation, correctness, and fluency of responses generated by LLMs in QA systems. Second, we propose an automated feedback mechanism that leverages the critic model to offer real-time feedback on heterogeneous aspects of generated text. Third, we introduce a feedback learning loop that uses this critic model to iteratively improve the performance of the LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research