MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification

Shadman Sobhan; Kazi Abrar Mahmud; Abduz Zami

arXiv:2506.21199·cs.CV·January 13, 2026

MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification

Shadman Sobhan, Kazi Abrar Mahmud, Abduz Zami

PDF

TL;DR

MedPrompt is a unified framework that combines large language models and modular CNNs to enable flexible, prompt-driven medical image analysis across multiple tasks and modalities, with high accuracy and efficiency.

Contribution

The paper introduces MedPrompt, a novel system that integrates LLM-based task planning with weight routing in CNNs, allowing scalable multi-task medical imaging without retraining.

Findings

01

Achieves 97% correctness in task interpretation and execution.

02

Demonstrates high segmentation accuracy (Dice 0.9856) and classification F1 (0.9744).

03

Operates with an average latency of 2.5 seconds, suitable for near real-time use.

Abstract

Current medical image analysis systems are typically task-specific, requiring separate models for classification and segmentation, and lack the flexibility to support user-defined workflows. To address these challenges, we introduce MedPrompt, a unified framework that combines a few-shot prompted Large Language Model (Llama-4-17B) for high-level task planning with a modular Convolutional Neural Network (DeepFusionLab) for low-level image processing. The LLM interprets user instructions and generates structured output to dynamically route task-specific pretrained weights. This weight routing approach avoids retraining the entire framework when adding new tasks-only task-specific weights are required, enhancing scalability and deployment. We evaluated MedPrompt across 19 public datasets, covering 12 tasks spanning 5 imaging modalities. The system achieves a 97% end-to-end correctness in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.