ClinSeekAgent: Automating Multimodal Evidence Seeking for Agentic Clinical Reasoning
Juncheng Wu, Letian Zhang, Yuhan Wang, Haoqin Tu, Hardy Chen, Zijun Wang, Cihang Xie, Yuyin Zhou

TL;DR
ClinSeekAgent is an automated framework that actively seeks, refines, and synthesizes multimodal evidence from heterogeneous sources to enhance clinical decision-making with large language models.
Contribution
It introduces a novel agentic system for dynamic evidence seeking in clinical workflows, improving decision accuracy and serving as both inference and training pipeline.
Findings
Improves F1 scores on text-only EHR tasks by up to 3.2 points.
Enhances multimodal task performance with a 15.1 point increase in F1.
Distilled evidence-seeking trajectories into a high-performing open-source model.
Abstract
Large language models (LLMs) and agentic systems have shown promise for clinical decision support, but existing works largely assume that evidence has already been curated and handed to the model. Real-world clinical workflows instead require agents to actively seek, iteratively plan, and synthesize multimodal evidence from heterogeneous sources. In this paper, we introduce ClinSeekAgent, an automated agentic framework for dynamic multimodal evidence seeking that shifts the paradigm from passive evidence consumption to active evidence acquisition. Given only a clinical query and access to raw data sources, ClinSeekAgent gathers evidence by querying medical knowledge bases, navigating raw EHRs, and invoking medical imaging tools; refines its hypotheses as new information emerges; and integrates the collected evidence into grounded clinical decisions. ClinSeekAgent serves both as an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
