Reviewing Clinical Knowledge in Medical Large Language Models: Training and Beyond

Qiyuan Li; Haijiang Liu; Caicai Guo; Chao Gao; Deyu Chen; Meng Wang; Feng Gao; Frank van Harmelen; Jinguang Gu

arXiv:2502.20988·cs.AI·August 12, 2025

Reviewing Clinical Knowledge in Medical Large Language Models: Training and Beyond

Qiyuan Li, Haijiang Liu, Caicai Guo, Chao Gao, Deyu Chen, Meng Wang, Feng Gao, Frank van Harmelen, Jinguang Gu

PDF

TL;DR

This paper reviews methods for integrating clinical medical knowledge into large language models, including training, knowledge graphs, and retrieval techniques, highlighting current practices, challenges, and industrial applications.

Contribution

It provides a comprehensive survey of current approaches to embed clinical knowledge into medical LLMs, emphasizing diversity and real-world relevance.

Findings

01

Knowledge integration improves model accuracy and traceability.

02

Knowledge graphs and retrieval methods are key to enhancing medical LLMs.

03

Industrial applications reveal gaps between research and practice.

Abstract

The large-scale development of large language models (LLMs) in medical contexts, such as diagnostic assistance and treatment recommendations, necessitates that these models possess accurate medical knowledge and deliver traceable decision-making processes. Clinical knowledge, encompassing the insights gained from research on the causes, prognosis, diagnosis, and treatment of diseases, has been extensively examined within real-world medical practices. Recently, there has been a notable increase in research efforts aimed at integrating this type of knowledge into LLMs, encompassing not only traditional text and multimodal data integration but also technologies such as knowledge graphs (KGs) and retrieval-augmented generation (RAG). In this paper, we review the various initiatives to embed clinical knowledge into training-based, KG-supported, and RAG-assisted LLMs. We begin by gathering…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.