Improving Efficiency of Training a Virtual Treatment Planner Network via Knowledge-guided Deep Reinforcement Learning for Intelligent Automatic Treatment Planning of Radiotherapy
Chenyang Shen, Liyuan Chen, Yesenia Gonzalez, Xun Jia

TL;DR
This paper introduces a knowledge-guided deep reinforcement learning method that significantly accelerates training of a virtual treatment planner network for radiotherapy, enabling efficient development for complex clinical scenarios.
Contribution
The study proposes a novel KgDRL approach that incorporates human expert rules to improve training efficiency of VTPN in radiotherapy planning.
Findings
KgDRL reduces training time from over a week to 13 hours.
VTPN trained with KgDRL achieves comparable plan quality to traditional DRL.
Incorporating human knowledge enhances training efficiency and scalability.
Abstract
We previously proposed an intelligent automatic treatment planning framework for radiotherapy, in which a virtual treatment planner network (VTPN) was built using deep reinforcement learning (DRL) to operate a treatment planning system (TPS). Despite the success, the training of VTPN via DRL was time consuming. Also the training time is expected to grow with the complexity of the treatment planning problem, preventing the development of VTPN for more complicated but clinically relevant scenarios. In this study we proposed a knowledge-guided DRL (KgDRL) that incorporated knowledge from human planners to guide the training process to improve the training efficiency. Using prostate cancer intensity modulated radiation therapy as a testbed, we first summarized a number of rules of operating our in-house TPS. In training, in addition to randomly navigating the state-action space, as in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
