MedVerse: Efficient and Reliable Medical Reasoning via DAG-Structured Parallel Execution
Jianwen Chen, Xinyu Yang, Peng Xia, Arian Azarang, Yueh Z Lee, Gang Li, Hongtu Zhu, Yun Li, Beidi Chen, Huaxiu Yao

TL;DR
MedVerse introduces a DAG-structured parallel reasoning framework for medical inference, enhancing efficiency and reliability of large language models in complex medical tasks.
Contribution
It reformulates medical reasoning as a DAG process, incorporating a new data pipeline, topology-aware attention, and a parallel inference engine, with code available online.
Findings
Up to 8.9% performance improvement over strong LLMs.
1.3x reduction in inference latency compared to specialized medical LLMs.
1.7x increase in generation throughput due to parallel decoding.
Abstract
Large language models (LLMs) have demonstrated strong performance and rapid progress in a wide range of medical reasoning tasks. However, their sequential autoregressive decoding forces inherently parallel clinical reasoning, such as differential diagnosis, into a single linear reasoning path, limiting both efficiency and reliability for complex medical problems. To address this, we propose MedVerse, a reasoning framework for complex medical inference that reformulates medical reasoning as a parallelizable directed acyclic graph (DAG) process based on Petri net theory. The framework adopts a full-stack design across data, model architecture, and system execution. For data creation, we introduce the MedVerse Curator, an automated pipeline that synthesizes knowledge-grounded medical reasoning paths and transforms them into Petri net-structured representations. At the architectural level,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
