ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents
Yusheng Liao, Shuyang Jiang, Yanfeng Wang, Yu Wang

TL;DR
ReflecTool is a reflection-aware framework that enhances clinical agents by utilizing domain-specific tools and long-term memory, significantly improving performance across diverse medical tasks compared to existing methods.
Contribution
The paper introduces ReflecTool, a novel framework that leverages long-term memory and reflection to improve tool usage in clinical agents, along with a comprehensive benchmark for evaluation.
Findings
ReflecTool outperforms pure LLMs by over 10 points on the ClinicalAgent Benchmark.
ReflecTool surpasses existing agent-based methods by 3 points.
Extensive experiments demonstrate its adaptability and effectiveness in complex clinical tasks.
Abstract
Large Language Models (LLMs) have shown promising potential in the medical domain, assisting with tasks like clinical note generation and patient communication. However, current LLMs are limited to text-based communication, hindering their ability to interact with diverse forms of information in clinical environments. Despite clinical agents succeeding in diverse signal interaction, they are oriented to a single clinical scenario and hence fail for broader applications. To evaluate clinical agents holistically, we propose ClinicalAgent Bench~(CAB), a comprehensive medical agent benchmark consisting of 18 tasks across five key realistic clinical dimensions. Building on this, we introduce ReflecTool, a novel framework that excels at utilizing domain-specific tools within two stages. The first optimization stage progressively enlarges a long-term memory by saving successful solving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsBusiness Process Modeling and Analysis · Scientific Computing and Data Management
