# A Review of Multi-Agent AI Systems for Biological and Clinical Data Analysis

**Authors:** Jackson Spieser, Ali Balapour, Jarek Meller, Krushna C. Patra, Behrouz Shamsaei

PMC · DOI: 10.3390/mps9020033 · 2026-02-28

## TL;DR

This paper reviews how multi-agent AI systems can improve biomedical data analysis by working together more effectively than single AI models.

## Contribution

The paper introduces a novel analysis of multi-agent systems' performance and challenges in biomedical and clinical data analysis.

## Key findings

- MAS architectures improved oncology decision-making accuracy from 30.3% to 87.2%.
- MAS reached 93.2% accuracy on USMLE-style benchmarks through simulated clinical evolution.
- MAS frameworks enhanced clinical trial matching accuracy to 87.3% and improved clinician screening efficiency by 42.6%.

## Abstract

This review evaluates the emerging paradigm of multi-agent systems (MASs) for biomedical and clinical data analysis, focusing on their ability to overcome the reasoning and reliability limitations of standalone large language models (LLMs). We synthesize findings from recent architectural frameworks, specifically LangGraph, CrewAI, and the Model Context Protocol (MCP), to examine how specialized agent teams divide labor, utilize precision tools, and cross-verify outputs. We find that MAS architectures yield significant performance gains in various domains: recent implementations improved oncology decision-making accuracy from 30.3% to 87.2% and reached a peak of 93.2% accuracy on USMLE-style benchmarks through simulated clinical evolution. In clinical trial matching, multi-agent frameworks achieved 87.3% accuracy and enhanced clinician screening efficiency by 42.6% (p < 0.001). However, we also highlight critical operational challenges, including an unreliability tax of 15–50× higher token consumption compared to standalone models and the risk of cascading errors where initial hallucinations are amplified across the agent collective. We conclude that while MAS enables a shift toward collaborative intelligence in biomedicine, its clinical and research adoption requires the development of deterministic orchestration and rigorous cost-utility frameworks to ensure safety and expert-centered oversight.

## Full-text entities

- **Genes:** MRGPRX1 (MAS related GPR family member X1) [NCBI Gene 259249] {aka GPCR, MGRG2, MRGX1, SNSR4}, BRAF (B-Raf proto-oncogene, serine/threonine kinase) [NCBI Gene 673] {aka B-RAF1, B-raf, BRAF-1, BRAF1, NS7, RAFB1}, KRAS (KRAS proto-oncogene, GTPase) [NCBI Gene 3845] {aka 'C-K-RAS, C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A}
- **Diseases:** cardiac, and genetic disorders (MESH:D006331), Cancer (MESH:D009369), contact dermatitis (MESH:D003877), symptom X (MESH:D012816), MSI (MESH:D053842), injury to (MESH:D014947), hallucination (MESH:D006212), instability (MESH:D043171), common cold (MESH:D003139), LLMs (MESH:D007806), MAS (MESH:D015161), rheumatic heart disease (MESH:D012214), XAI (MESH:C538243)
- **Chemicals:** LLM (-)
- **Species:** Miconchus sp. AS (species) [taxon 312913], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** A2A

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13010680/full.md

---
Source: https://tomesphere.com/paper/PMC13010680