# Evaluation of a machine learning system for genomic antimicrobial susceptibility determination on a clinically representative test set

**Authors:** Jason D. Wittenbach, Arolyn Conwill, Hayden Sansum, Alison Gassett, Adam Gardner, Allison Brookhart, Talia Hollowell, Paul Knysh, Nicholas B. Worley, Nicole Billings, Ian C. Herriott, Julie A. Shimabukuro, Kathleen A. Quan, Keith M. Madey, Susan S. Huang, Mohamad R. A. Sater, Cassiana E. Bittencourt, Miriam H. Huntley

PMC · DOI: 10.1128/spectrum.00564-25 · Microbiology Spectrum · 2026-01-27

## TL;DR

A new machine learning system accurately predicts antibiotic resistance from bacterial genomes, outperforming traditional methods.

## Contribution

The paper introduces Keynome gAST, a machine learning system for genomic AST with high accuracy across diverse species-drug combinations.

## Key findings

- Keynome gAST achieved 96.9% categorical agreement with phenotypic AST results across 97 species-drug combinations.
- Keynome gAST outperformed ResFinder with 96.0% binary accuracy versus 83.5%.
- ResFinder's limitations were due to low sensitivity and specificity in identifying resistance markers.

## Abstract

Next-generation sequencing is anticipated to transform infectious disease treatment by enabling faster diagnosis, yet a clinical-grade method for genomic AST determination is still lacking. We assessed the accuracy of Keynome gAST, a k-mer based machine learning system for genomic AST, on 956 clinical bacterial isolates. We compared 7,801 predictions of susceptible (S), intermediate (I), and resistant (R) across 97 species-drug combinations against phenotypic AST results. Across the full data set, the Keynome gAST Qualified panel achieved a categorical agreement of 96.9% with very major and major error rates of 1.4% and 0.8%, respectively. At the level of individual species-drug combinations with sufficient data to assess, performance was comparable to the aggregate, with median categorical agreement of 97.4%. We also compared the performance of Keynome gAST to ResFinder, a simple resistance marker approach, in distinguishing susceptible vs non-susceptible samples. On the full data set, Keynome gAST performance was significantly better than that of ResFinder (binary accuracy of 96.0% vs 83.5%). When analyzed at the species-drug level, ResFinder’s poor performance was seen to be driven by either low sensitivity due to phenotypically resistant samples lacking resistance markers (18.0% of such samples) or low specificity due to the presence of resistance markers in phenotypically susceptible samples (15.1% of such samples). In total, these results demonstrate the dual advantages of machine learning and whole-genome profiling that Keynome gAST has over simple resistance marker approaches, enabling high accuracy genomic AST across a range of clinically relevant species-drug combinations.

Microbial genome sequencing presents an exciting opportunity for rapid diagnosis of infectious diseases, but interpreting the resulting data for clinical use remains a challenge. We report a new machine learning method that predicts a bacterial strain's antibiotic resistance profile based solely on its genomic sequence. This method could lead to new, faster diagnostic tools that quickly identify the most effective antibiotic therapy.

## Full-text entities

- **Genes:** SLC17A5 (solute carrier family 17 member 5) [NCBI Gene 26503] {aka AST, ISSD, NSD, SD, SIALIN, SIASD}
- **Diseases:** infectious disease (MESH:D003141)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12955427/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12955427/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/PMC12955427/full.md

---
Source: https://tomesphere.com/paper/PMC12955427