Enhancing LLMs' Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry
Junu Kim, Chaeeun Shim, Sungjin Park, Su Yeon Lee, Gee Young Suh,, Chae-Man Lim, Seong Jin Choi, Song Mi Moon, Kyoung-Ho Song, Eu Suk Kim, Hong, Bin Kim, Sejoong Kim, Chami Im, Dong-Wan Kang, Yong Soo Kim, Hee-Joon Bae,, Sung Yoon Lim, Han-Gil Jeong, Edward Choi

TL;DR
This paper improves large language models' clinical reasoning by fine-tuning them with real-world sepsis registry data, leading to better performance across diverse clinical tasks and datasets.
Contribution
It introduces a method to enhance LLMs' clinical reasoning using real-world data and reinforcement learning, creating a model called C-Reason with improved capabilities.
Findings
C-Reason outperforms baseline models on clinical reasoning tasks.
The model generalizes well across different datasets and diseases.
Expert evaluations confirm improved reasoning quality.
Abstract
Although large language models (LLMs) have demonstrated impressive reasoning capabilities across general domains, their effectiveness in real-world clinical practice remains limited. This is likely due to their insufficient exposure to real-world clinical data during training, as such data is typically not included due to privacy concerns. To address this, we propose enhancing the clinical reasoning capabilities of LLMs by leveraging real-world clinical data. We constructed reasoning-intensive questions from a nationwide sepsis registry and fine-tuned Phi-4 on these questions using reinforcement learning, resulting in C-Reason. C-Reason exhibited strong clinical reasoning capabilities on the in-domain test set, as evidenced by both quantitative metrics and expert evaluations. Furthermore, its enhanced reasoning capabilities generalized to a sepsis dataset involving different tasks and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies
MethodsFocus
