# GWAS and Machine Learning Screening of Genomic Determinants Underlying Host Adaptation in Swine and Chicken Salmonella Typhimurium Isolates

**Authors:** Yifan Liu, Yuhao Wang, Yaxi Wang, Xiao Liu, Shuang Wang, Yao Peng, Ziyu Liu, Zhenpeng Li, Xin Lu, Biao Kan

PMC · DOI: 10.3390/microorganisms14020293 · 2026-01-27

## TL;DR

This study identifies genetic features in Salmonella Typhimurium that help it adapt to pigs and chickens, using genomics and machine learning.

## Contribution

The study combines GWAS and interpretable machine learning to reveal host-specific genomic signatures in Salmonella Typhimurium.

## Key findings

- Chicken-adapted S. Typhimurium subpopulations show lower genetic diversity and distinct virulence factors.
- GWAS identified 1878 host-associated genes and SNPs/indels linked to metabolism and regulation.
- A Random Forest model accurately distinguished swine and chicken origins with an AUC of 0.974.

## Abstract

Salmonella Typhimurium is a major zoonotic pathogen, with pigs and chickens serving as key reservoirs for human infection, yet the genomic determinants of its host adaptation remain incompletely understood. This study integrated comparative genomics, genome-wide association studies (GWASs), and interpretable machine learning on 1654 high-quality genomes of swine- and chicken-origin S. Typhimurium isolates to identify host-associated genetic features. Phylogenetic analysis revealed host-preferred lineages and significantly lower genetic diversity within chicken-adapted subpopulations. Meta-analysis identified distinct host-associated profiles of antimicrobial resistance genes (e.g., higher prevalence of floR and blaTEM-1 in swine) and virulence factors (e.g., enrichment of allB and the yersiniabactin system in chickens). GWASs pinpointed 1878 host-associated genes and multiple SNPs/indels, functionally enriched in metabolism, regulation, and cell processes. A two-stage Random Forest model, built using the most contributory features, accurately discriminated between swine and chicken origins (AUC = 0.974). These findings systematically revealed the genomic signatures of host adaptation in S. Typhimurium, providing a prioritized set of candidate markers for experimental validation.

## Linked entities

- **Genes:** floR (chloramphenicol/florfenicol efflux MFS transporter FloR) [NCBI Gene 57334229], allB (allantoinase) [NCBI Gene 945134]

## Full-text entities

- **Diseases:** toxicity (MESH:D064420), infection (MESH:D007239), gastrointestinal infection (MESH:D005767), deaths (MESH:D003643), MGEs (MESH:D014086), Salmonella (MESH:D012480), VF (MESH:C537182), gastroenteritis (MESH:D005759), inflammation (MESH:D007249), injury to (MESH:D014947)
- **Chemicals:** chloramphenicol (MESH:D002701), florfenicol (MESH:C035534), beta-lactam (MESH:D047090), LPS (MESH:D008070), purine (MESH:C030985), ARG (MESH:D001120), trimethoprim (MESH:D014295), tetracycline (MESH:D013752), aadA2 (-), allantoin (MESH:D000481), iron (MESH:D007501), sialic acid (MESH:D019158), carbon (MESH:D002244), streptomycin (MESH:D013307), ceftiofur (MESH:C053503), sulfonamide (MESH:D013449), nitrogen (MESH:D009584), aminoglycoside (MESH:D000617), yersiniabactin (MESH:C104398), metal (MESH:D008670), pyruvate (MESH:D019289), spectinomycin (MESH:D000198)
- **Species:** Gallus gallus (bantam, species) [taxon 9031], Bos taurus (bovine, species) [taxon 9913], Homo sapiens (human, species) [taxon 9606], Salmonella enterica subsp. enterica serovar Typhimurium (no rank) [taxon 90371], Streptococcus suis (species) [taxon 1307], Sus scrofa (pig, species) [taxon 9823], Salmonella enterica (species) [taxon 28901], Salmonella enterica subsp. enterica serovar Enteritidis (no rank) [taxon 149539], Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 (strain) [taxon 99287], Pseudomonas aeruginosa (species) [taxon 287], Escherichia coli (E. coli, species) [taxon 562]

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12943566/full.md

---
Source: https://tomesphere.com/paper/PMC12943566