Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring
Tiantian Zhang, Manxi Lin, Hongda Guo, Xiaofan Zhang, Ka Fung Peter, Chiu, Aasa Feragen, Qi Dou

TL;DR
This paper presents a novel multi-modal large language model approach that incorporates clinical guidelines into prostate cancer MRI scoring, improving accuracy without additional annotations or network complexity.
Contribution
It introduces a two-stage fine-tuning process to adapt a multi-modal large language model for PI-RADS scoring by integrating clinical guidelines directly into the model.
Findings
Improved scoring accuracy on public and in-house datasets.
Effective integration of clinical guidelines without extra annotations.
Enhanced model performance through feature distillation.
Abstract
The Prostate Imaging Reporting and Data System (PI-RADS) is pivotal in the diagnosis of clinically significant prostate cancer through MRI imaging. Current deep learning-based PI-RADS scoring methods often lack the incorporation of common PI-RADS clinical guideline~(PICG) utilized by radiologists, potentially compromising scoring accuracy. This paper introduces a novel approach that adapts a multi-modal large language model (MLLM) to incorporate PICG into PI-RADS scoring model without additional annotations and network parameters. We present a designed two-stage fine-tuning process aiming at adapting a MLLM originally trained on natural images to the MRI images while effectively integrating the PICG. Specifically, in the first stage, we develop a domain adapter layer tailored for processing 3D MRI inputs and instruct the MLLM to differentiate MRI sequences. In the second stage, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Radiomics and Machine Learning in Medical Imaging · Biomedical Text Mining and Ontologies
MethodsAdapter · ALIGN
