Assessing and Enhancing Large Language Models in Rare Disease Question-answering
Guanchu Wang, Junhao Ran, Ruixiang Tang, Chia-Yuan Chang and, Chia-Yuan Chang, Yu-Neng Chuang, Zirui Liu, Vladimir Braverman and, Zhandong Liu, Xia Hu

TL;DR
This paper evaluates the performance of large language models in diagnosing rare diseases, introduces a new dataset for assessment, and proposes a retrieval-augmented approach to improve accuracy and trustworthiness.
Contribution
It introduces the ReDis-QA dataset for rare disease diagnosis evaluation and the ReCOP corpus for retrieval augmentation, enhancing LLM performance in this domain.
Findings
LLMs struggle with rare disease diagnosis.
ReCOP improves LLM accuracy by 8%.
Retrieval augmentation guides trustworthy answers.
Abstract
Despite the impressive capabilities of Large Language Models (LLMs) in general medical domains, questions remain about their performance in diagnosing rare diseases. To answer this question, we aim to assess the diagnostic performance of LLMs in rare diseases, and explore methods to enhance their effectiveness in this area. In this work, we introduce a rare disease question-answering (ReDis-QA) dataset to evaluate the performance of LLMs in diagnosing rare diseases. Specifically, we collected 1360 high-quality question-answer pairs within the ReDis-QA dataset, covering 205 rare diseases. Additionally, we annotated meta-data for each question, facilitating the extraction of subsets specific to any given disease and its property. Based on the ReDis-QA dataset, we benchmarked several open-source LLMs, revealing that diagnosing rare diseases remains a significant challenge for these models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
