# P-1758. Informatics-Genomic Pipeline for Large-Scale Extraction and Classification of AMR Genes Using CARD and AMRFinderPlus

**Authors:** Damani Andre, Ali Aslam, Rayven Todd, Muhammed Idris, Rafael Ponce-Terashima, Kenneth Onyedibe, Ubong Aniefiok Udoh, Philip Nwajiobi-Princewill, Ayomide Henrietta Adeyemi

PMC · DOI: 10.1093/ofid/ofaf695.1929 · Open Forum Infectious Diseases · 2026-01-11

## TL;DR

This study compares tools for extracting antimicrobial resistance genes from bacterial genomes, finding AMRFinderPlus more reliable and consistent across operating systems.

## Contribution

A custom Python pipeline was developed to automate AMR gene extraction and characterization, benchmarking RGI and AMRFinderPlus.

## Key findings

- AMRFinderPlus showed consistent performance across Windows, macOS, and Linux systems.
- RGI had installation issues on macOS and Windows but worked reliably when containerized with Docker.
- The pipeline successfully extracted resistance genes from over 4,000 isolates of Klebsiella pneumoniae and Acinetobacter baumannii.

## Abstract

The global rise in antimicrobial resistance (AMR) poses a major threat to public health, requiring robust bioinformatic tools essential for extracting and tracking resistance genes from databases to understand AMR gene transmission and spread. Hence, this study benchmarked AMR gene extraction methods from databases such as Comprehensive Antibiotic Resistance Database (CARD) and AMRFinderPlus.Table 1:Comparison of operating systems seamlessly supported by RGI and AMRFinderPlusFigure 1:Workflow for AMR Gene Extraction from Bacterial Genomes

Comparison of operating systems seamlessly supported by RGI and AMRFinderPlus

Workflow for AMR Gene Extraction from Bacterial Genomes

Genomic sequences of Klebsiella pneumoniae (KP) and Acinetobacter baumannii (AB) were first downloaded using Python codes. To find and categorize the AMR genes we first used Resistance Gene Identifier (RGI), the recommended tool by CARD. We also evaluated KP and AB genome characterization using the AMRFinderPlus, a tool supported by the National Center for Biotechnology Information. We compared the ease of AMR genome characterization across Windows, macOS, and Linux systems. A custom Python pipeline was developed to automate extraction and characterization of AMR gene sequences from genome FASTA files. The workflow was designed to be capable of handling strand orientation, contig structure, and formatting differences across genomes (Figure 1).Figure 2:A detailed breakdown of gene coordinates, contig IDs, and gene annotations is included for representative samples

A detailed breakdown of gene coordinates, contig IDs, and gene annotations is included for representative samples

Resistance Gene Identifier had installation issues on macOS and windows due to strict Python version dependencies. However, it worked reliably on Windows when containerized using Docker. Also, RGI resulted in incomplete annotations, coordinate discrepancies, and file access errors. In contrast, AMRFinderPlus exhibited structured output and consistent performance across Windows, macOS, and Linux systems (Table 1). The final workflow enabled accurate extraction of resistance genes from 1123 AB isolates (10,651 Contig IDs) and 3286 KP isolates (31,406 Contig IDs). Sequences were stratified by gene name, resistance class, contig ID, start-stop location and compiled into structured datasets by Python coding. Representative annotations are shown in Figure 2.

These findings suggest that AMRFinderPlus is a reliable solution for resistance gene characterization which could support broader applications in genomic surveillance and comparative analysis.

All Authors: No reported disclosures

## Linked entities

- **Species:** Klebsiella pneumoniae (taxon 573), Acinetobacter baumannii (taxon 470)

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12793628/full.md

---
Source: https://tomesphere.com/paper/PMC12793628