TL;DR
MedRoute introduces a dynamic multi-agent framework with reinforcement learning to emulate clinical specialist collaboration, improving diagnostic accuracy over static approaches in multimodal medical diagnosis.
Contribution
This work presents MedRoute, a flexible multi-agent system with RL-based specialist routing and a moderator, closely mirroring real clinical workflows.
Findings
Outperforms state-of-the-art baselines in diagnostic accuracy
Demonstrates effectiveness on text and image-based medical datasets
Validates the adaptability of dynamic specialist selection
Abstract
Medical diagnosis using Large Multimodal Models (LMMs) has gained increasing attention due to capability of these models in providing precise diagnoses. These models generally combine medical questions with visual inputs to generate diagnoses or treatments. However, they are often overly general and unsuitable under the wide range of medical conditions in real-world healthcare. In clinical practice, diagnosis is performed by multiple specialists, each contributing domain-specific expertise. To emulate this process, a potential solution is to deploy a dynamic multi-agent LMM framework, where each agent functions as a medical specialist. Current approaches in this emerging area, typically relying on static or predefined selection of various specialists, cannot be adapted to the changing practical scenario. In this paper, we propose MedRoute, a flexible and dynamic multi-agent framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
