Evo-MedAgent: Beyond One-Shot Diagnosis with Agents That Remember, Reflect, and Improve

Weixiang Shen; Bailiang Jian; Jun Li; Che Liu; Johannes Moll; Xiaobin Hu; Daniel Rueckert; Hongwei Bran Li; Jiazhen Pan

arXiv:2604.14475·cs.AI·April 17, 2026

Evo-MedAgent: Beyond One-Shot Diagnosis with Agents That Remember, Reflect, and Improve

Weixiang Shen, Bailiang Jian, Jun Li, Che Liu, Johannes Moll, Xiaobin Hu, Daniel Rueckert, Hongwei Bran Li, Jiazhen Pan

PDF

TL;DR

Evo-MedAgent introduces a self-evolving memory system for medical diagnosis agents, enabling inter-case learning and reflection to improve accuracy without additional training.

Contribution

The paper presents a novel memory module that allows medical agents to learn from past cases and refine their diagnostic heuristics at test time.

Findings

01

Evo-MedAgent improves MCQ accuracy from 0.68 to 0.79 on GPT-5-mini.

02

It enhances diagnostic performance more than external tool orchestration.

03

No additional training is required, with minimal per-case overhead.

Abstract

Tool-augmented large language model (LLM) agents can orchestrate specialist classifiers, segmentation models, and visual question-answering modules to interpret chest X-rays. However, these agents still solve each case in isolation: they fail to accumulate experience across cases, correct recurrent reasoning mistakes, or adapt their tool-use behavior without expensive reinforcement learning. While a radiologist naturally improves with every case, current agents remain static. In this work, we propose Evo-MedAgent, a self-evolving memory module that equips a medical agent with the capacity for inter-case learning at test time. Our memory comprises three complementary stores: (1)~\emph{Retrospective Clinical Episodes} that retrieve problem-solving experiences from similar past cases, (2)~an \emph{Adaptive Procedural Heuristics} bank curating priority-tagged diagnostic rules that evolves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.