Reducing Hallucinations in Summarization via Reinforcement Learning with Entity Hallucination Index

Praveenkumar Katwe; Rakesh Chandra; Balabantaray Kali; Prasad Vittala

arXiv:2507.22744·cs.CL·July 31, 2025

Reducing Hallucinations in Summarization via Reinforcement Learning with Entity Hallucination Index

Praveenkumar Katwe, Rakesh Chandra, Balabantaray Kali, Prasad Vittala

PDF

TL;DR

This paper presents a reinforcement learning-based fine-tuning method that improves the factual accuracy of summaries by reducing entity hallucinations, using an automatic entity grounding metric called EHI, without human annotations.

Contribution

It introduces a scalable, reward-driven fine-tuning framework that explicitly optimizes for entity grounding in summarization models, reducing hallucinations effectively.

Findings

01

Significant reduction in entity hallucinations across datasets.

02

Improved entity correctness without sacrificing fluency.

03

Reproducible pipeline released for further research.

Abstract

Reducing hallucinations in abstractive summarization remains a critical challenge for deploying language models (LMs) in real-world settings. In this work, we introduce a rewarddriven fine-tuning framework that explicitly optimizes for Entity Hallucination Index (EHI), a metric designed to quantify the presence, correctness, and grounding of named entities in generated summaries. Given a corpus of meeting transcripts, we first generate baseline summaries using a pre-trained LM and compute EHI scores via automatic entity extraction and matching. We then apply reinforcement learning to fine-tune the model parameters, using EHI as a reward signal to bias generation toward entity-faithful outputs. Our approach does not rely on human-written factuality annotations, enabling scalable fine-tuning. Experiments demonstrate consistent improvements in EHI across datasets, with qualitative analysis…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.