TL;DR
PASS introduces a probabilistic, interpretable, and adaptive multimodal framework for chest X-ray reasoning, addressing trust, safety, and efficiency issues in medical AI by dynamically selecting tools and providing decision path probabilities.
Contribution
It is the first framework to integrate probabilistic, interpretable, and adaptive reasoning in multimodal medical AI, specifically for chest X-ray analysis, with a novel training procedure and benchmark.
Findings
Outperforms baselines in accuracy and safety metrics
Balances performance with computational efficiency
Provides interpretable decision trajectories
Abstract
Existing tool-augmented agentic systems are limited in the real world by (i) black-box reasoning steps that undermine trust of decision-making and pose safety risks, (ii) poor multimodal integration, which is inherently critical for healthcare tasks, and (iii) rigid and computationally inefficient agentic pipelines. We introduce PASS (Probabilistic Agentic Supernet Sampling), the first multimodal framework to address these challenges in the context of Chest X-Ray (CXR) reasoning. PASS adaptively samples agentic workflows over a multi-tool graph, yielding decision paths annotated with interpretable probabilities. Given the complex CXR reasoning task with multimodal medical data, PASS leverages its learned task-conditioned distribution over the agentic supernet. Thus, it adaptively selects the most suitable tool at each supernet layer, offering probability-annotated trajectories for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
