Learning to Verify Summary Facts with Fine-Grained LLM Feedback

Jihwan Oh; Jeonghwan Choi; Nicole Hee-Yeon Kim; Taewon Yun; Hwanjun; Song

arXiv:2412.10689·cs.CL·December 17, 2024

Learning to Verify Summary Facts with Fine-Grained LLM Feedback

Jihwan Oh, Jeonghwan Choi, Nicole Hee-Yeon Kim, Taewon Yun, Hwanjun, Song

PDF

Open Access 1 Repo

TL;DR

This paper presents FineSumFact, a large-scale dataset of fine-grained factual feedback on summaries generated by multiple LLMs, used to fine-tune a lightweight model that outperforms human-annotated data in fact verification tasks.

Contribution

Introduces FineSumFact, a novel dataset leveraging LLM-generated feedback for training summary fact verifiers, improving efficiency and performance over human-labeled data.

Findings

01

Model trained on LLM-generated data outperforms human-labeled data-based models.

02

Fine-tuning with LLM feedback is more cost-effective than human feedback.

03

The dataset enables high-performance fact verification with lightweight models.

Abstract

Training automatic summary fact verifiers often faces the challenge of a lack of human-labeled data. In this paper, we explore alternative way of leveraging Large Language Model (LLM) generated feedback to address the inherent limitation of using human-labeled data. We introduce FineSumFact, a large-scale dataset containing fine-grained factual feedback on summaries. We employ 10 distinct LLMs for diverse summary generation and Llama-3-70B-Instruct for feedback. We utilize this dataset to fine-tune the lightweight open-source model Llama-3-8B-Instruct, optimizing resource efficiency while maintaining high performance. Our experimental results reveal that the model trained on extensive LLM-generated datasets surpasses that trained on smaller human-annotated datasets when evaluated using human-generated test sets. Fine-tuning fact verification models with LLM feedback can be more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

disl-lab/finesumfact
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical and Computational Modeling · Forecasting Techniques and Applications · Stock Market Forecasting Methods