ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents

Yusheng Liao; Shuyang Jiang; Yanfeng Wang; Yu Wang

arXiv:2410.17657·cs.CL·June 17, 2025

ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents

Yusheng Liao, Shuyang Jiang, Yanfeng Wang, Yu Wang

PDF

Open Access 3 Repos 1 Video

TL;DR

ReflecTool is a reflection-aware framework that enhances clinical agents by utilizing domain-specific tools and long-term memory, significantly improving performance across diverse medical tasks compared to existing methods.

Contribution

The paper introduces ReflecTool, a novel framework that leverages long-term memory and reflection to improve tool usage in clinical agents, along with a comprehensive benchmark for evaluation.

Findings

01

ReflecTool outperforms pure LLMs by over 10 points on the ClinicalAgent Benchmark.

02

ReflecTool surpasses existing agent-based methods by 3 points.

03

Extensive experiments demonstrate its adaptability and effectiveness in complex clinical tasks.

Abstract

Large Language Models (LLMs) have shown promising potential in the medical domain, assisting with tasks like clinical note generation and patient communication. However, current LLMs are limited to text-based communication, hindering their ability to interact with diverse forms of information in clinical environments. Despite clinical agents succeeding in diverse signal interaction, they are oriented to a single clinical scenario and hence fail for broader applications. To evaluate clinical agents holistically, we propose ClinicalAgent Bench~(CAB), a comprehensive medical agent benchmark consisting of 18 tasks across five key realistic clinical dimensions. Building on this, we introduce ReflecTool, a novel framework that excels at utilizing domain-specific tools within two stages. The first optimization stage progressively enlarges a long-term memory by saving successful solving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents· underline

Taxonomy

TopicsBusiness Process Modeling and Analysis · Scientific Computing and Data Management