A learning-driven automatic planning framework for proton PBS treatments of H&N cancers
Qingqing Wang, Liqiang Xiao, Chang Chang

TL;DR
This paper introduces a novel learning-driven framework combining inverse optimization and reinforcement learning to automate and improve proton PBS treatment planning for head & neck cancers, achieving faster and higher-quality plans.
Contribution
It develops a scalable L2O inverse optimizer using long-context techniques and integrates it with a PPO-based planner, advancing automated treatment planning methods.
Findings
Improves inverse optimization efficiency by 23-36%.
Generates treatment plans within 2.55 hours on average.
Achieves comparable or better organ-at-risk sparing and target coverage.
Abstract
Proton pencil beam scanning (PBS) treatment planning for head & neck (H&N) cancers involves numerous conflicting objectives, requiring iterative objective parameter adjustments to balance multiple clinical goals. We propose a learning-driven inverse optimizer and integrate it into a proximal policy optimization (PPO)-based planning framework to automatically generate high-quality plans for patients with diverse treatment requirements. The inverse optimizer is a learning-to-optimize (L2O) method that predicts update steps by learning from task-specific data distributions. For the first time, long-context processing techniques developed for large language models (LLMs) are utilized to address the scalability limitations of existing L2O methods, enabling simultaneous optimization over a substantially large set of variables. The PPO framework functions as an outer-loop virtual planner,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiation Therapy and Dosimetry
