Large Language Models Vote: Prompting for Rare Disease Identification
David Oniani, Jordan Hilsman, Hang Dong, Fengyi Gao, Shiven Verma,, Yanshan Wang

TL;DR
This paper introduces Models-Vote Prompting (MVP), a novel ensemble prompting method that improves rare disease identification from clinical notes using LLMs in Few-Shot Learning settings, supported by a new dataset.
Contribution
The paper proposes MVP, a flexible ensemble prompting approach for LLMs, and releases a new rare disease dataset for FSL, enhancing rare disease detection accuracy.
Findings
MVP outperforms individual LLMs in rare disease classification.
Ensemble voting improves accuracy in Few-Shot Learning scenarios.
Automated evaluation with JSON is feasible for LLM outputs.
Abstract
The emergence of generative Large Language Models (LLMs) emphasizes the need for accurate and efficient prompting approaches. LLMs are often applied in Few-Shot Learning (FSL) contexts, where tasks are executed with minimal training data. FSL has become popular in many Artificial Intelligence (AI) subdomains, including AI for health. Rare diseases affect a small fraction of the population. Rare disease identification from clinical notes inherently requires FSL techniques due to limited data availability. Manual data collection and annotation is both expensive and time-consuming. In this paper, we propose Models-Vote Prompting (MVP), a flexible prompting approach for improving the performance of LLM queries in FSL settings. MVP works by prompting numerous LLMs to perform the same tasks and then conducting a majority vote on the resulting outputs. This method achieves improved results to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
