WoLF: Wide-scope Large Language Model Framework for CXR Understanding
Seil Kang, Donghyun Kim, Junhyeok Kim, Hyo Kyung Lee, Seong Jae Hwang

TL;DR
WoLF is a comprehensive framework that enhances chest X-ray understanding by integrating multi-faceted patient records, structured report generation, and advanced AI evaluation, outperforming existing models in VQA and report metrics.
Contribution
The paper introduces WoLF, a novel large language model framework that incorporates multi-source patient data, anatomical report structuring, and specialized evaluation for improved CXR understanding.
Findings
Superior performance on MIMIC-CXR dataset in VQA (+9.47%p)
Enhanced report generation metrics (+7.3%p BLEU-1)
Effective integration of electronic health records and anatomical report structuring
Abstract
Significant methodological strides have been made toward Chest X-ray (CXR) understanding via modern vision-language models (VLMs), demonstrating impressive Visual Question Answering (VQA) and CXR report generation abilities. However, existing CXR understanding frameworks still possess several procedural caveats. (1) Previous methods solely use CXR reports, which are insufficient for comprehensive Visual Question Answering (VQA), especially when additional health-related data like medication history and prior diagnoses are needed. (2) Previous methods use raw CXR reports, which are often arbitrarily structured. While modern language models can understand various text formats, restructuring reports for clearer, organized anatomy-based information could enhance their usefulness. (3) Current evaluation methods for CXR-VQA primarily emphasize linguistic correctness, lacking the capability to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Computational Techniques and Applications · Advanced Data Processing Techniques · Multi-Agent Systems and Negotiation
