ClinSeekAgent: Automating Multimodal Evidence Seeking for Agentic Clinical Reasoning

Juncheng Wu; Letian Zhang; Yuhan Wang; Haoqin Tu; Hardy Chen; Zijun Wang; Cihang Xie; Yuyin Zhou

arXiv:2605.20176·cs.CL·May 20, 2026

ClinSeekAgent: Automating Multimodal Evidence Seeking for Agentic Clinical Reasoning

Juncheng Wu, Letian Zhang, Yuhan Wang, Haoqin Tu, Hardy Chen, Zijun Wang, Cihang Xie, Yuyin Zhou

PDF

1 Repo 1 Models 1 Datasets

TL;DR

ClinSeekAgent is an automated framework that actively seeks, refines, and synthesizes multimodal evidence from heterogeneous sources to enhance clinical decision-making with large language models.

Contribution

It introduces a novel agentic system for dynamic evidence seeking in clinical workflows, improving decision accuracy and serving as both inference and training pipeline.

Findings

01

Improves F1 scores on text-only EHR tasks by up to 3.2 points.

02

Enhances multimodal task performance with a 15.1 point increase in F1.

03

Distilled evidence-seeking trajectories into a high-performing open-source model.

Abstract

Large language models (LLMs) and agentic systems have shown promise for clinical decision support, but existing works largely assume that evidence has already been curated and handed to the model. Real-world clinical workflows instead require agents to actively seek, iteratively plan, and synthesize multimodal evidence from heterogeneous sources. In this paper, we introduce ClinSeekAgent, an automated agentic framework for dynamic multimodal evidence seeking that shifts the paradigm from passive evidence consumption to active evidence acquisition. Given only a clinical query and access to raw data sources, ClinSeekAgent gathers evidence by querying medical knowledge bases, navigating raw EHRs, and invoking medical imaging tools; refines its hypotheses as new information emerges; and integrates the collected evidence into grounded clinical decisions. ClinSeekAgent serves both as an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ucsc-vlaa/ClinSeekAgent
github

Models

🤗
UCSC-VLAA/ClinSeek-35B-A3B
model· 76 dl
76 dl

Datasets

UCSC-VLAA/ClinSeek-Bench
dataset· 107 dl
107 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.