RuleAlign: Making Large Language Models Better Physicians with   Diagnostic Rule Alignment

Xiaohan Wang; Xiaoyan Yang; Yuqi Zhu; Yue Shen; Jian Wang; Peng Wei,; Lei Liang; Jinjie Gu; Huajun Chen; Ningyu Zhang

arXiv:2408.12579·cs.CL·August 23, 2024

RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment

Xiaohan Wang, Xiaoyan Yang, Yuqi Zhu, Yue Shen, Jian Wang, Peng Wei,, Lei Liang, Jinjie Gu, Huajun Chen, Ningyu Zhang

PDF

Open Access

TL;DR

RuleAlign enhances large language models' diagnostic accuracy by aligning them with medical rules, improving their ability to simulate physician reasoning and patient interaction.

Contribution

The paper introduces RuleAlign, a novel framework that aligns LLMs with diagnostic rules using preference learning and a new medical dialogue dataset.

Findings

01

Improved diagnostic reasoning in LLMs

02

Effective alignment with medical rules demonstrated

03

Potential to serve as AI physicians

Abstract

Large Language Models (LLMs) like GPT-4, MedPaLM-2, and Med-Gemini achieve performance competitively with human experts across various medical benchmarks. However, they still face challenges in making professional diagnoses akin to physicians, particularly in efficiently gathering patient information and reasoning the final diagnosis. To this end, we introduce the RuleAlign framework, designed to align LLMs with specific diagnostic rules. We develop a medical dialogue dataset comprising rule-based communications between patients and physicians and design an alignment learning approach through preference learning. Experimental results demonstrate the effectiveness of the proposed approach. We hope that our work can serve as an inspiration for exploring the potential of LLMs as AI physicians.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning in Healthcare

MethodsAttention Is All You Need · Linear Layer · Residual Connection · Multi-Head Attention · Adam · Layer Normalization · Position-Wise Feed-Forward Layer · Dense Connections · Byte Pair Encoding · Absolute Position Encodings