Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications

Jiawei He; Boya Zhang; Hossein Rouhizadeh; Yingjian Chen; Rui Yang; Jin Lu; Xudong Chen; Nan Liu; Douglas Teodoro

arXiv:2505.01146·q-bio.OT·December 19, 2025

Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications

Jiawei He, Boya Zhang, Hossein Rouhizadeh, Yingjian Chen, Rui Yang, Jin Lu, Xudong Chen, Nan Liu, Douglas Teodoro

PDF

TL;DR

This survey reviews retrieval-augmented generation (RAG) in biomedicine, discussing its architectures, challenges like latency and privacy, and future directions for clinical applications and multimodal integration.

Contribution

It provides a comprehensive classification, formalizes the biomedical RAG trilemma, and analyzes recent advancements and challenges in deploying RAG for clinical use.

Findings

01

Classifies biomedical RAG systems into naive, advanced, and modular paradigms.

02

Identifies trade-offs between reasoning, latency, and privacy in RAG deployment.

03

Highlights the potential and risks of agentic workflows in diagnostics.

Abstract

Large language models (LLMs) in biomedicine face a fundamental conflict between static parameter knowledge and the dynamic nature of clinical evidence. Retrieval-Augmented Generation (RAG) addresses this by grounding generation in external data, yet it introduces new complexities in latency and architecture. This survey synthesizes the biomedical RAG landscape (2020-2025), classifying systems into naive, advanced, and modular paradigms. Beyond a technological taxonomy, we formalize the biomedical RAG trilemma, identifying the inherent trade-offs between reasoning depth, inference latency, and data privacy that constrain current clinical deployment. We analyze how recent agentic workflows enhance diagnostic reasoning but risk prohibitive latency, and how privacy constraints dictate the choice between powerful cloud-based models and local deployment. Finally, we outline the alignment gap…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout · Layer Normalization · Byte Pair Encoding · Attention Dropout · Softmax · Residual Connection · WordPiece