Hybrid assembled genomes of three Salmonella enterica isolates from municipal wastewater in Tennessee, USA
Jia Wang, Lauren K. Hudson, Daniel W. Bryan, Thomas G. Denes

TL;DR
This study presents hybrid genome assemblies of three Salmonella strains isolated from wastewater in Tennessee, identifying their serotypes.
Contribution
The paper provides new hybrid genome assemblies and serotype identification for Salmonella isolates from municipal wastewater.
Findings
Three Salmonella enterica isolates were obtained from a wastewater treatment plant in Knoxville, Tennessee.
Hybrid assemblies revealed two isolates as serotype Bareilly and one as serotype Typhimurium.
Abstract
Three Salmonella enterica isolates were isolated from a municipal wastewater treatment plant in Knoxville, TN, USA. Genomic DNA was extracted and sequenced on the Illumina and Oxford Nanopore platforms. Annotated hybrid assemblies showed the strains were predicted to be serotypes Bareilly (n = 2) and Typhimurium (n = 1).
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
| Strain | No. of Illumina reads | No. of ONT reads | N50 of ONT reads | Total length (bp) | No. of chromosome contigs | No. of plasmid contigs | GC content (%) | Predicted serotype | Illumina read coverage (×) | Nanopore read coverage (×) | No. of CDSs | No. of tRNAs | No. of CRISPR arrays | Antibiotic resistance genes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| UTK W1-0001 | 2,419,110 | 5,000 | 19,045 | 4,964,787 | 1 | 1 | 52.17 | Typhimurium | 66 | 9 | 4,739 | 86 | 4 |
|
| UTK W1-0003 | 3,066,142 | 8,000 | 17,340 | 4,843,513 | 1 | 1 | 52.05 | Bareilly | 90 | 14 | 4,648 | 85 | 3 |
|
| UTK W1-0011 | 2,943,720 | 7,000 | 17,533 | 5,032,162 | 1 | 2 | 52.14 | Bareilly | 83 | 12 | 4,871 | 86 | 2 |
|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSalmonella and Campylobacter epidemiology · Genomics and Phylogenetic Studies · Aquaculture disease management and microbiota
ANNOUNCEMENT
Salmonella enterica is a globally widespread bacterial pathogen, causing an estimated 100 million infections and over 200,000 deaths annually (1, 2). Reporting novel S. enterica strains from municipal wastewater contributes genomic resources for comparative analyses and genomic surveillance. This study presents the complete and draft genome sequences of three S. enterica strains (UTK W1-0001, UTK W1-0003, and UTK W1-0011) isolated from municipal wastewater samples collected at a local wastewater treatment plant in Knoxville, Tennessee, USA.
Samples were collected from the primary effluent of municipal wastewater on 9 October 2023. The collected wastewater samples were filtered and enriched for Salmonella spp. following the modified Food and Drug Administration’s Bacteriological Analytical Manual protocol (3, 4). Single colonies were isolated on selective media, and pure cultures were prepared. The presumptive Salmonella isolates were cultured overnight in brain heart infusion broth at 37°C with shaking at 200 rpm. DNA was extracted by the DNeasy Blood and Tissue Kit (Qiagen), assessed with Nanodrop (Thermo Fisher Scientific) and Qubit 4 (Thermo Fisher Scientific), and shipped to SeqCenter (Pittsburgh, PA, USA). Illumina short-read sequencing was performed using the Illumina DNA Prep kit and NovaSeq X Plus sequencer (Illumina), producing 151 bp paired-end reads (5). For Nanopore sequencing, DNA was extracted using the Zymo Quick-DNA HMW MagBead kit (Zymo Research Corp.). The library was prepared using the Oxford Nanopore rapid barcoding kit (SQK-RBK114.24) and sequenced with a MinION sequencer with Flongle (Oxford Nanopore Technologies). Basecalling was performed by MinKNOW (v24.06.5) using the fast basecalling model with adapter trimming, default quality score threshold, and no size-selection (6). Read quality was checked by FastQC v0.11.9 (7), and trimming was performed by Trimmomatic v0.39 (8). The hybrid assembly was performed using Unicycler v0.4.8 in bold mode, including overlap identification and trimming, with rotation to start at dnaA/repA (9). For all software, default parameters were used except where otherwise noted.
BLAST analysis (10) of the 7 kb contig in UTK W1-0011 assembly revealed 96.7% nucleotide identity across 81% of the query sequence length to plasmid NZ_CP128883.1. kSNP4 (v4.1) was used to identify core genome single-nucleotide polymorphism (SNP) among these three Salmonella isolates (11), revealing each was a distinct strain. Genome sequencing data and annotation features are summarized in Table 1. The genomes of strains UTK W1-0001, UTK W1-0003, and UTK W1-0011 were predicted to have 151-152 virulence factor genes. Key virulence determinants included fimbrial and nonfimbrial adherence determinants, macrophage inducible gene mig-14, and type III secretion system (T3SS) encoded by Salmonella pathogenicity island 1 (SPI-1) and 2 (SPI-2). Compared with previously isolated Salmonella enterica strains harboring 100-130 virulence genes identified by VFDB (12–15), the three newly isolated Salmonella enterica strains exhibit a notable virulence profile potentially indicative of aggressive pathogenic characteristics.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Wang BX, Butler DSC, Hamblin M, Monack DM. 2023. One species, different diseases: the unique molecular mechanisms that underlie the pathogenesis of typhoidal Salmonella infections. Curr Opin Microbiol 72:102262. doi:10.1016/j.mib.2022.10226236640585 PMC 10023398 · doi ↗ · pubmed ↗
- 2Sears KT, Galen JE, Tennant SM. 2021. Advances in the development of Salmonella-based vaccine strategies for protection against Salmonellosis in humans. J Appl Microbiol 131:2640–2658. doi:10.1111/jam.1505533665941 PMC 9292744 · doi ↗ · pubmed ↗
- 3Strawn LK, Gröhn YT, Warchocki S, Worobo RW, Bihn EA, Wiedmann M. 2013. Risk factors associated with Salmonella and Listeria monocytogenes contamination of produce fields. Appl Environ Microbiol 79:7618–7627. doi:10.1128/AEM.02831-1324077713 PMC 3837806 · doi ↗ · pubmed ↗
- 4Strawn LK, Fortes ED, Bihn EA, Nightingale KK, Gröhn YT, Worobo RW, Wiedmann M, Bergholz PW. 2013. Landscape and meteorological factors affecting prevalence of three food-borne pathogens in fruit and vegetable farms. Appl Environ Microbiol 79:588–600. doi:10.1128/AEM.02491-1223144137 PMC 3553790 · doi ↗ · pubmed ↗
- 5Wang J, Schamp CN, Hudson LK, Chaggar HK, Bryan DW, Garman KN, Radosevich MA, Denes TG. 2025. Whole-genome sequencing and metagenomics reveals diversity and prevalence of soil Listeria spp. in the Nantahala National Forest. Microbiol Spectr 13:e 0171224. doi:10.1128/spectrum.01712-2439651889 PMC 11705966 · doi ↗ · pubmed ↗
- 6Technologies ON. 2025. Min KNOW. Available from: https://nanoporetech.com/document/experiment-companion-minknow
- 7Andrews S. 2010. Fast QC: a quality control tool for high throughput sequence data, on Babraham Bioinformatics, Babraham Institute, Cambridge, United Kingdom. Available from: https://www.bioinformatics.babraham.ac.uk/projects/fastqc
- 8Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi:10.1093/bioinformatics/btu 17024695404 PMC 4103590 · doi ↗ · pubmed ↗
