From Generalist to Specialist: Improving Large Language Models for Medical Physics Using ARCoT
Jace Grandinetti, Rafe McBeth

TL;DR
This paper presents ARCoT, a retrieval-based framework that enhances large language models' accuracy in medical physics by integrating domain-specific information and reasoning techniques, outperforming standard models without fine-tuning.
Contribution
ARCoT introduces a novel retrieval and prompting approach that improves domain-specific reasoning in LLMs without requiring model fine-tuning.
Findings
ARCoT outperformed standard LLMs and matched human performance on medical physics exams.
The method achieved up to 68% improvement in accuracy.
ARCoT reduced hallucinations and increased domain-specific reliability.
Abstract
Large Language Models (LLMs) have achieved remarkable progress, yet their application in specialized fields, such as medical physics, remains challenging due to the need for domain-specific knowledge. This study introduces ARCoT (Adaptable Retrieval-based Chain of Thought), a framework designed to enhance the domain-specific accuracy of LLMs without requiring fine-tuning or extensive retraining. ARCoT integrates a retrieval mechanism to access relevant domain-specific information and employs step-back and chain-of-thought prompting techniques to guide the LLM's reasoning process, ensuring more accurate and context-aware responses. Benchmarking on a medical physics multiple-choice exam, our model outperformed standard LLMs and reported average human performance, demonstrating improvements of up to 68% and achieving a high score of 90%. This method reduces hallucinations and increases…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies
