WSI-Agents: A Collaborative Multi-Agent System for Multi-Modal Whole Slide Image Analysis

Xinheng Lyu; Yuci Liang; Wenting Chen; Meidan Ding; Jiaqi Yang; Guolin Huang; Daokun Zhang; Xiangjian He; and Linlin Shen

arXiv:2507.14680·cs.CV·July 22, 2025

WSI-Agents: A Collaborative Multi-Agent System for Multi-Modal Whole Slide Image Analysis

Xinheng Lyu, Yuci Liang, Wenting Chen, Meidan Ding, Jiaqi Yang, Guolin Huang, Daokun Zhang, Xiangjian He, and Linlin Shen

PDF

TL;DR

WSI-Agents introduces a collaborative multi-agent system that enhances multi-modal whole slide image analysis by improving accuracy and versatility through specialized agents, task allocation, and verification mechanisms, outperforming existing models.

Contribution

The paper presents a novel multi-agent framework for WSI analysis that combines specialized agents, task allocation, and verification to improve performance and multi-task capabilities.

Findings

01

WSI-Agents outperforms current WSI MLLMs and medical agent frameworks.

02

The system effectively balances accuracy and versatility in multi-modal WSI tasks.

03

Extensive experiments demonstrate superior performance across diverse benchmarks.

Abstract

Whole slide images (WSIs) are vital in digital pathology, enabling gigapixel tissue analysis across various pathological tasks. While recent advancements in multi-modal large language models (MLLMs) allow multi-task WSI analysis through natural language, they often underperform compared to task-specific models. Collaborative multi-agent systems have emerged as a promising solution to balance versatility and accuracy in healthcare, yet their potential remains underexplored in pathology-specific domains. To address these issues, we propose WSI-Agents, a novel collaborative multi-agent system for multi-modal WSI analysis. WSI-Agents integrates specialized functional agents with robust task allocation and verification mechanisms to enhance both task-specific accuracy and multi-task versatility through three components: (1) a task allocation module assigning tasks to expert agents using a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.