Beyond N-grams: A Hierarchical Reward Learning Framework for Clinically-Aware Medical Report Generation
Yuan Wang, Shujian Gao, Jiaxiang Liu, Songtao Jiang, Haoxiang Xia, Xiaotian Zhang, Zhaolu Kang, Yemin Wang, Zuozhu Liu

TL;DR
This paper introduces HiMed-RL, a hierarchical reward learning framework for medical report generation that enhances factual accuracy and diagnostic consistency, significantly reducing clinical hallucinations and improving trustworthiness.
Contribution
It proposes a novel hierarchical reward system with dynamic adjustment, explicitly optimizing linguistic fluency, factual grounding, and diagnostic consistency in medical report generation.
Findings
Achieves state-of-the-art performance on multiple benchmarks
Improves out-of-domain report quality by 12.1%
Reduces clinical hallucinations and factual errors
Abstract
Automatic medical report generation can greatly reduce the workload of doctors, but it is often unreliable for real-world deployment. Current methods can write formally fluent sentences but may be factually flawed, introducing serious medical errors known as clinical hallucinations, which make them untrustworthy for diagnosis. To bridge this gap, we introduce HiMed-RL, a Hierarchical Medical Reward Learning Framework designed to explicitly prioritize clinical quality. HiMed-RL moves beyond simple text matching by deconstructing reward learning into three synergistic levels: it first ensures linguistic fluency at the token-level, then enforces factual grounding at the concept-level by aligning key medical terms with expert knowledge, and finally assesses high-level diagnostic consistency at the semantic-level using a specialized LLM verifier. This hierarchical reward is implemented via a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare · Biomedical Text Mining and Ontologies
