Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology
Peilong Wang, Zhengliang Liu, Yiwei Li, Jason Holmes, Peng Shu, Lian, Zhang, Xiang Li, Quanzheng Li, Brady S. Laughlin, Diego Santos Toesca, Sujay, A. Vora, Samir H. Patel, Terence T. Sio, Tianming Liu, Wei Liu

TL;DR
This study demonstrates that fine-tuning open-source large language models with domain-specific data can significantly enhance their performance on key radiation oncology tasks, supporting their potential clinical application.
Contribution
It is the first to fine-tune open-source LLMs for radiation oncology tasks, showing improved accuracy and clinical acceptability over original models.
Findings
Fine-tuned models outperform original models in accuracy and ROUGE-1 scores.
Over 60% of generated treatment regimens were clinically acceptable.
Statistically significant improvements with p-value <= 0.001.
Abstract
Background: The radiation oncology clinical practice involves many steps relying on the dynamic interplay of abundant text data. Large language models have displayed remarkable capabilities in processing complex text information. But their direct applications in specific fields like radiation oncology remain underexplored. Purpose: This study aims to investigate whether fine-tuning LLMs with domain knowledge can improve the performance on Task (1) treatment regimen generation, Task (2) treatment modality selection (photon, proton, electron, or brachytherapy), and Task (3) ICD-10 code prediction in radiation oncology. Methods: Data for 15,724 patient cases were extracted. Cases where patients had a single diagnostic record, and a clearly identifiable primary treatment plan were selected for preprocessing and manual annotation to have 7,903 cases of the patient diagnosis, treatment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Radiomics and Machine Learning in Medical Imaging · Lung Cancer Diagnosis and Treatment
