TL;DR
MiniGPT-Pancreas is a multimodal large language model designed to assist clinicians in pancreas cancer diagnosis by integrating visual and textual data, showing promising results in detection and classification tasks.
Contribution
This work introduces MiniGPT-Pancreas, a fine-tuned multimodal model specifically adapted for pancreas cancer detection and classification using multimodal prompts and medical imaging datasets.
Findings
Achieved IoU of 0.595 for pancreas detection on NIH dataset.
Achieved 87.6% accuracy in pancreas cancer classification.
IoU of 0.168 for pancreas tumor detection on MSD dataset.
Abstract
Problem: Pancreas radiological imaging is challenging due to the small size, blurred boundaries, and variability of shape and position of the organ among patients. Goal: In this work we present MiniGPT-Pancreas, a Multimodal Large Language Model (MLLM), as an interactive chatbot to support clinicians in pancreas cancer diagnosis by integrating visual and textual information. Methods: MiniGPT-v2, a general-purpose MLLM, was fine-tuned in a cascaded way for pancreas detection, tumor classification, and tumor detection with multimodal prompts combining questions and computed tomography scans from the National Institute of Health (NIH), and Medical Segmentation Decathlon (MSD) datasets. The AbdomenCT-1k dataset was used to detect the liver, spleen, kidney, and pancreas. Results: MiniGPT-Pancreas achieved an Intersection over Union (IoU) of 0.595 and 0.550 for the detection of pancreas on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
