Transforming Multimodal Models into Action Models for Radiotherapy

Matteo Ferrante; Alessandra Carosi; Rolando Maria D Angelillo; Nicola; Toschi

arXiv:2502.04408·cs.LG·February 10, 2025

Transforming Multimodal Models into Action Models for Radiotherapy

Matteo Ferrante, Alessandra Carosi, Rolando Maria D Angelillo, Nicola, Toschi

PDF

Open Access

TL;DR

This paper introduces a novel approach that transforms large multimodal foundation models into action models for radiotherapy treatment planning, improving efficiency and plan quality through few-shot reinforcement learning and simulation.

Contribution

It presents a new framework that leverages pre-trained multimodal models and adapts them for treatment planning using few-shot RL, enhancing accuracy and efficiency.

Findings

01

Outperforms traditional RL methods in treatment plan quality.

02

Achieves higher reward scores and better dose distributions in simulations.

03

Demonstrates potential for clinical workflow integration.

Abstract

Radiotherapy is a crucial cancer treatment that demands precise planning to balance tumor eradication and preservation of healthy tissue. Traditional treatment planning (TP) is iterative, time-consuming, and reliant on human expertise, which can potentially introduce variability and inefficiency. We propose a novel framework to transform a large multimodal foundation model (MLM) into an action model for TP using a few-shot reinforcement learning (RL) approach. Our method leverages the MLM's extensive pre-existing knowledge of physics, radiation, and anatomy, enhancing it through a few-shot learning process. This allows the model to iteratively improve treatment plans using a Monte Carlo simulator. Our results demonstrate that this method outperforms conventional RL-based approaches in both quality and efficiency, achieving higher reward scores and more optimal dose distributions in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman-Automation Interaction and Safety · Robotics and Automated Systems · Speech and dialogue systems