# Measuring representation in clinical trials: Simulations demonstrating how current methods fail in the context of precision medicine

**Authors:** Andrew Friedson, Abigail Humphreys, June Cha

PMC · DOI: 10.1371/journal.pone.0342711 · 2026-03-10

## TL;DR

This paper shows that current methods for measuring how well clinical trials represent diverse populations fail when applied to precision medicine, where treatments target specific subgroups.

## Contribution

The paper introduces simulations to demonstrate how existing representativeness metrics break down when comparing to demographically distinct populations in precision medicine.

## Key findings

- 24%, 40%, and 32% of trials gave different representativeness results with 5 percentage point demographic shifts for sex, race, and ethnicity.
- Larger demographic differences between registry and simulated populations led to worse performance of representativeness metrics.
- Current methods are inaccurate for precision medicine contexts targeting specific subgroups.

## Abstract

Clinical trial representativeness is vital for the evaluation of intervention performance and study generalizability. Current methods for evaluating representativeness are limited by the quality of the disease registry data used and may not appropriately evaluate studies aimed at precision cohorts. This study evaluates the sensitivity of existing methods for measuring study representation to differences between the population targeted by the clinical intervention and the closest available population in a patient registry. Using records for U.S.-based cancer clinical trials registered to ClinicalTrials.gov from 2017–2023 and the U.S. Cancer Statistics Public Use Database we calculated representativeness measures by comparing the demographic mix (based on sex, race, and ethnicity) in each clinical trial to the demographic mix for the same form of cancer in the U.S. Cancer Statistics database via exact Binomial tests. Then, the same tests were conducted comparing trial populations to simulated populations that were demographically different from the U.S. Cancer Statistics data by fixed percentages. The outcome of interest was whether the result of the test changed when the comparator population was different. For clinical trials reporting the sex, race, and ethnicity of participants, 24, 40, and 32 percent of studies (respectively) give different results when the difference between the registry population and simulated population is 5 percentage points. For all demographics, larger differences between the registry and the simulated populations were associated with worse metric performance. Analyses of clinical trial representativeness suffer from a large loss of accuracy in settings where treatments are targeted to demographically different subgroups. Studies of representation in the context of precision medicine should be interpreted with caution.

## Full-text entities

- **Genes:** CYP2C19 (cytochrome P450 family 2 subfamily C member 19) [NCBI Gene 1557] {aka CPCJ, CYP2C, CYPIIC17, CYPIIC19, P450C2C, P450IIC19}, ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}
- **Diseases:** colorectal cancer (MESH:D015179), cardiovascular illnesses (MESH:D002318), breast cancer (MESH:D001943), TNBC (MESH:D064726), cancers of the cervix, uterus, ovary, prostate and testis (MESH:D010051), prostate cancer (MESH:D011471), Cancer (MESH:D009369), single- (MESH:D012640)
- **Chemicals:** Clopidogrel (MESH:D000077144)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12974884/full.md

---
Source: https://tomesphere.com/paper/PMC12974884