Quantify unmet medical need across the disease landscape – A large language model-based methodology

Elliott W. Sharp; Nicholas Fragola; Charlotte Blewitt; Matthew Goddeeris; Lee Lancashire; Charlie Hempstead; David C. Fajgenbaum

PMC · DOI:10.1371/journal.pmed.1004798·March 12, 2026

Quantify unmet medical need across the disease landscape – A large language model-based methodology

Elliott W. Sharp, Nicholas Fragola, Charlotte Blewitt, Matthew Goddeeris, Lee Lancashire, Charlie Hempstead, David C. Fajgenbaum

PDF

Open Access

TL;DR

This paper introduces a scalable AI method to quantify unmet medical need across all human diseases, enabling faster and cheaper prioritization of research efforts.

Contribution

A novel LLM-based framework to systematically score unmet medical need across 22,701 diseases using 11 criteria, enabling rapid and low-cost prioritization.

Findings

01

The LLM-based system scored all 22,701 diseases in ~1 hour at a cost of $120 USD.

02

LLM scores showed strong agreement with expert rankings (e.g., Spearman’s rho of 0.845 for mortality).

03

The method is flexible, allowing organizations to adjust criteria weights based on their priorities.

Abstract

Despite the ultimate goal of medical researchers and funders being to maximize patient benefit, there is no systematic process for quantifying unmet medical need across diseases. While a relative unmet medical need scoring system would be valuable for prioritization of medical research, systematically performing this effort across all 22,701 human diseases is technically challenging, time-consuming, and expensive. Using a large language model-based (LLM) architecture, we built a scalable method demonstrating feasibility to quantify “unmet medical need” criteria across all diseases, combine those criteria into a single weighted score, and extend the method into new criteria or diseases in the future. We aimed to quantitatively determine which diseases have the greatest unmet medical need and, therefore, which diseases are priority targets for new repurposed treatments. We defined 11…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Figures10

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHealth Systems, Economic Evaluations, Quality of Life · Machine Learning in Healthcare · Chronic Disease Management Strategies