From Generalist to Specialist: Improving Large Language Models for   Medical Physics Using ARCoT

Jace Grandinetti; Rafe McBeth

arXiv:2405.11040·cs.CL·May 21, 2024

From Generalist to Specialist: Improving Large Language Models for Medical Physics Using ARCoT

Jace Grandinetti, Rafe McBeth

PDF

Open Access

TL;DR

This paper presents ARCoT, a retrieval-based framework that enhances large language models' accuracy in medical physics by integrating domain-specific information and reasoning techniques, outperforming standard models without fine-tuning.

Contribution

ARCoT introduces a novel retrieval and prompting approach that improves domain-specific reasoning in LLMs without requiring model fine-tuning.

Findings

01

ARCoT outperformed standard LLMs and matched human performance on medical physics exams.

02

The method achieved up to 68% improvement in accuracy.

03

ARCoT reduced hallucinations and increased domain-specific reliability.

Abstract

Large Language Models (LLMs) have achieved remarkable progress, yet their application in specialized fields, such as medical physics, remains challenging due to the need for domain-specific knowledge. This study introduces ARCoT (Adaptable Retrieval-based Chain of Thought), a framework designed to enhance the domain-specific accuracy of LLMs without requiring fine-tuning or extensive retraining. ARCoT integrates a retrieval mechanism to access relevant domain-specific information and employs step-back and chain-of-thought prompting techniques to guide the LLM's reasoning process, ensuring more accurate and context-aware responses. Benchmarking on a medical physics multiple-choice exam, our model outperformed standard LLMs and reported average human performance, demonstrating improvements of up to 68% and achieving a high score of 90%. This method reduces hallucinations and increases…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies