LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve Factualness in Large Language Models
Hieu Tran, Junda Wang, Yujan Ting, Weijing Huang, Terrence Chen

TL;DR
LEAF introduces a dual approach combining fact-checking and self-training to significantly improve the factual accuracy of large language models in knowledge-intensive tasks like medical question answering.
Contribution
It proposes a novel framework that integrates fact-checking into retrieval and training processes to enhance LLM factualness without altering core model parameters.
Findings
Fact-checked responses improve model accuracy
Fact-Check-Then-RAG enhances retrieval quality
Self-training with fact-checking boosts factual reliability
Abstract
Large language models (LLMs) have shown remarkable capabilities in various natural language processing tasks, yet they often struggle with maintaining factual accuracy, particularly in knowledge-intensive domains like healthcare. This study introduces LEAF: Learning and Evaluation Augmented by Fact-Checking, a novel approach designed to enhance the factual reliability of LLMs, with a focus on medical question answering (QA). LEAF utilizes a dual strategy to enhance the factual accuracy of responses from models such as Llama 3 70B Instruct and Llama 3 8B Instruct. The first strategy, Fact-Check-Then-RAG, improves Retrieval-Augmented Generation (RAG) by incorporating fact-checking results to guide the retrieval process without updating model parameters. The second strategy, Learning from Fact-Checks via Self-Training, involves supervised fine-tuning (SFT) on fact-checked responses or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Deception detection and forensic psychology
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Dense Connections · Layer Normalization · Residual Connection · Linear Warmup With Linear Decay · WordPiece · Adam
