CureAgent: A Training-Free Executor-Analyst Framework for Clinical Reasoning
Ting-Ting Xie, Yixin Zhang

TL;DR
CureAgent introduces a modular, training-free framework combining specialized executors and long-context models to improve clinical reasoning, addressing context utilization failure and scaling challenges in biomedical AI.
Contribution
The paper proposes a novel Executor-Analyst modular architecture with stratified ensemble strategies, achieving state-of-the-art performance without finetuning.
Findings
Mitigates reasoning deficits in biomedical LLMs.
Demonstrates the effectiveness of stratified ensemble over global pooling.
Identifies critical scaling insights like context-performance paradox.
Abstract
Current clinical agent built on small LLMs, such as TxAgent suffer from a \textit{Context Utilization Failure}, where models successfully retrieve biomedical evidence due to supervised finetuning but fail to ground their diagnosis in that information. In this work, we propose the Executor-Analyst Framework, a modular architecture that decouples the syntactic precision of tool execution from the semantic robustness of clinical reasoning. By orchestrating specialized TxAgents (Executors) with long-context foundation models (Analysts), we mitigate the reasoning deficits observed in monolithic models. Beyond simple modularity, we demonstrate that a Stratified Ensemble strategy significantly outperforms global pooling by preserving evidentiary diversity, effectively addressing the information bottleneck. Furthermore, our stress tests reveal critical scaling insights: (1) a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)
