Interpretable Differential Diagnosis with Dual-Inference Large Language Models
Shuang Zhou, Mingquan Lin, Sirui Ding, Jiashuo Wang, Genevieve B., Melton, James Zou, Rui Zhang

TL;DR
This paper introduces a novel framework using large language models for interpretable differential diagnosis, creating a curated dataset and demonstrating improved accuracy and explanation capabilities in clinical settings.
Contribution
It presents the first dataset with expert interpretations for DDx and a dual-inference framework enabling bidirectional reasoning with LLMs for better interpretability.
Findings
Dual-Inf reduces interpretation errors in DDx.
It improves explanation quality for rare diseases.
Validated across four large language models.
Abstract
Automatic differential diagnosis (DDx) is an essential medical task that generates a list of potential diseases as differentials based on patient symptom descriptions. In practice, interpreting these differential diagnoses yields significant value but remains under-explored. Given the powerful capabilities of large language models (LLMs), we investigated using LLMs for interpretable DDx. Specifically, we curated the first DDx dataset with expert-derived interpretation on 570 clinical notes. Besides, we proposed Dual-Inf, a novel framework that enabled LLMs to conduct bidirectional inference (i.e., from symptoms to diagnoses and vice versa) for DDx interpretation. Both human and automated evaluation validated its efficacy in predicting and elucidating differentials across four base LLMs. In addition, Dual-Inf could reduce interpretation errors and hold promise for rare disease…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsBalanced Selection
