Attribute Structuring Improves LLM-Based Evaluation of Clinical Text Summaries
Zelalem Gero, Chandan Singh, Yiqing Xie, Sheng Zhang, Praveen, Subramanian, Paul Vozila, Tristan Naumann, Jianfeng Gao, Hoifung Poon

TL;DR
This paper introduces Attribute Structuring (AS), a framework that improves the evaluation of clinical text summaries by decomposing the process, leading to better alignment with human judgments and enabling efficient auditing.
Contribution
The paper proposes Attribute Structuring, a novel evaluation framework that enhances the accuracy and interpretability of LLM-based clinical summary assessments.
Findings
AS improves correlation with human annotations
AS provides interpretable evaluation spans
Framework enhances trustworthy clinical summary evaluation
Abstract
Summarizing clinical text is crucial in health decision-support and clinical research. Large language models (LLMs) have shown the potential to generate accurate clinical text summaries, but still struggle with issues regarding grounding and evaluation, especially in safety-critical domains such as health. Holistically evaluating text summaries is challenging because they may contain unsubstantiated information. Here, we explore a general mitigation framework using Attribute Structuring (AS), which structures the summary evaluation process. It decomposes the evaluation process into a grounded procedure that uses an LLM for relatively simple structuring and scoring tasks, rather than the full task of holistic summary evaluation. Experiments show that AS consistently improves the correspondence between human annotations and automated metrics in clinical text summarization. Additionally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques
