# A Machine Learning Model to Improve Risk Adjustment Accuracy in Medicare

**Authors:** Daniel K. Shenfeld, Lindsay Warrenburg, Eli Silvert, Matthew Guido, Maggie Makar, Karen Joynt Maddox, Amol S. Navathe, Ravi Bharat Parikh, Ezekiel J. Emanuel

PMC · DOI: 10.1111/1475-6773.70093 · 2026-03-05

## TL;DR

A new machine learning model called Franklin improves the accuracy of predicting healthcare costs for Medicare beneficiaries compared to the current system.

## Contribution

Franklin, a machine learning model, outperforms the traditional HCC system in predicting Medicare costs and improving risk adjustment accuracy.

## Key findings

- Franklin achieved higher predictive accuracy than HCC with R² log cost of 0.44 versus 0.15.
- Franklin improved accuracy for beneficiaries with few or no HCCs and for racial/ethnic minorities.
- Franklin outperformed HCC in detecting low-cost beneficiaries with higher sensitivity and specificity.

## Abstract

To develop a machine learning (ML) algorithm that improves accuracy compared to the Hierarchical Condition Category (HCC) score used by the Centers for Medicare and Medicaid Services to risk‐adjust payments for > 65 million Americans.

Prognostic study using Medicare claims data to train “Franklin”, an ML algorithm predicting one‐year costs, trained using identical data to HCC. Predictive accuracy was evaluated using R
2 log cost, Spearman rho, and sensitivity and specificity.

Random sample of 2018–2019 Part A and B claims from aged, community‐based enrollees in Traditional Medicare who were not dually eligible and did not have end‐stage renal disease.

The sample consisted of 4,176,666 Medicare beneficiaries (mean [SD] age 74.9 [7.2] years, 55.9% women; 85.9% Non‐Hispanic white, 5.6% African‐American, 3.4% Hispanic). Franklin was more accurate than HCC (R
2 log cost 0.44 vs. 0.15; Spearman rho 0.61 vs. 0.41, p < 0.001 for both). Accuracy improved for the 47% of beneficiaries with 0 HCCs and the 27% of beneficiaries with one HCC (Spearman rho 0.59 vs. 0.08 and 0.46 vs. 0.16, respectively; p < 0.001 for both). Franklin outperformed HCC in detecting the 20% lowest‐cost beneficiaries (sensitivity 0.60 vs. 0.34, specificity 0.90 vs. 0.83; p < 0.001 for both). Franklin improved accuracy over HCC for racial/ethnic minorities and rural‐dwelling beneficiaries (R
2 log cost Black 0.48 vs. 0.14, Hispanic 0.55 vs. 0.09, rural 0.36 v. 0.11; p < 0.001 for all), although Franklin disproportionately classified Black (15.8% vs. 10.1%) and Hispanic (22.9% vs. 12.2%) beneficiaries in the lowest predicted cost decile.

Franklin is an ML risk adjustment model that significantly improves risk‐adjustment accuracy for Medicare beneficiaries compared to HCC. Franklin could generate improvement in payment accuracy, reduction in selection incentives, and financial savings to Medicare. Clarifying the equity impacts of more accurate risk adjustment is necessary.

## Full-text entities

- **Genes:** GCG (glucagon) [NCBI Gene 2641] {aka GLP-1, GLP1, GLP2, GRPP}, PGR (progesterone receptor) [NCBI Gene 5241] {aka NR3C3, PR}, MAPT (microtubule associated protein tau) [NCBI Gene 4137] {aka DDPAC, FTD1, FTDP-17, MAPTL, MSTD, MTBT1}
- **Diseases:** Prostate Cancer (MESH:D011471), melanoma (MESH:D008545), HCC (MESH:D020763), Diabetes (MESH:D003920), end-stage renal disease (MESH:D007676), COVID 19 (MESH:D000086382), Cancer (MESH:D009369), Mental health disorders (OMIM:603663), abdominal pain (MESH:D015746), urinary tract infections (MESH:D014552), breast cancer (MESH:D001943), obesity (MESH:D009765), COPD (MESH:D029424), frailty (MESH:D000073496), CMS (MESH:C536089)
- **Chemicals:** HCC (-)
- **Species:** Melegrivirus A (no rank) [taxon 1330070], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** HCC36 — Homo sapiens (Human), Adult hepatocellular carcinoma, Cancer cell line (CVCL_VI90)

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12961717/full.md

---
Source: https://tomesphere.com/paper/PMC12961717