Complete genome sequence of Prevotella histicola T05-04
Kelyah Spurgeon, John R. Erb-Downward, Gary B. Huffnagle, Ariangela J. Kozik

TL;DR
This paper presents the complete genome sequence of Prevotella histicola T05-04, a bacterium linked to human oral cancer.
Contribution
The novelty is the closed genome assembly of P. histicola T05-04 using Nanopore sequencing.
Findings
The genome was previously fragmented into 109 contigs.
Nanopore sequencing enabled the assembly of a complete, closed genome.
This provides a more accurate reference for future studies on this bacterium.
Abstract
Prevotella histicola is a non-spore forming, obligatory anaerobic, gram-negative coccobacillus bacterium originally isolated from human oral squamous cell carcinoma tissue. Previously, the available genome for P. histicola T05-04 consisted of 109 contigs. Here, we report the complete (closed) genome sequence for Prevotella histicola T05-04 assembled using Nanopore sequencing.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Bacterial Identification and Susceptibility Testing · Oral microbiology and periodontitis research
ANNOUNCEMENT
The bacterial Prevotella genus is comprised of many human-associated species that are highly abundant at different sites of the body. The ecological role of Prevotella histicola as a prominent member of the oral microbiome and its relationship with other members of the microbiome has yet to be fully understood. Genome sequencing of type-strain P. histicola T05-04 will aid in future investigations of the genomic features underlying the biological role(s) of this member of the human microbiome.
The Prevotella genus (phylum, Bacteroidota; class, Bacteroidia; order, Bacteroidales; family, Prevotellaceae) consists of gram-negative, obligately anaerobic, non-spore forming bacteria. We purchased lyophilized P. histicola strain T05-04 (DSM 19854), which was reconstituted using pre-reduced medium (Brain Heart Infusion Broth from Anaerobe Systems, cat# AS-872) under anaerobic conditions at 37°C.
Before sequencing, P. histicola T05-04 was cultured in Tryptone Yeast Glucose (TYG) medium (1, 2) and incubated at 37°C for 24 hours, streak plated onto Schaedler Kanamycin Vancomycin agar with 5% sheep’s blood for 24 hours all under anoxic conditions. A single colony was cultured in TYG media, incubated for 24 hours, and then verified by Gram stain, colony PCR, and 16S rRNA Sanger sequencing. DNA was isolated using the DNeasy Blood and Tissue Kit yielding 208 ng/µL using Qubit dsDNA HS Assay and a Qubit Fluorometer. 260/280 (1.89) and 260/230 (1.87) ratios were quantified using a Nanodrop One UV-visible spectrophotometer.
DNA library preparation was performed using Oxford Nanopore Technologies Ligation Sequencing gDNA–Native Barcoding Kit 24 V14 (SQK NBD114.24, ONT, UK) according to the manufacturer’s protocol and sequenced on a FLO-MIN114 Flow Cell (ONT, UK) for 1.5 hours. DNA fragments of all sizes were enriched with Short Fragment Buffer (SKQ 1481067) without shearing. Read lengths of N_50_ 1,670 and 720.55 k raw reads were generated using Dorado 7.4.14 with a minimum Q score of 12 after raw reads were generated.
The genome sequence was assembled using FLYE 2.9.5 (3) with default parameters. Genome polishing was performed using BWA (4) with default parameters and Racon (5) using mismatching and matching bases set to 3 and −3, respectively. A final round of polishing was performed using Medaka (6) with default parameters. Final assembly of P. histicola resulted in three circular contigs 2,176,075 bp, 812,616 bp, and 6,399 bp in length. P. histicola strain F0411 has been previously described to have two chromosomes of different sizes and divergent sequences (NCBI RefSeq assembly GCF_018128125.1), which we have identified is also true for P. histicola T05-04. In addition, the third circular contig (coverage >1,000×) likely encodes for a plasmid. Reorientation of each assembled contig was performed using dnaapler (7). Final visualization of the three contigs was performed via Bandage (8). fastANI (9) using default parameters demonstrated 97% identity to Prevotella histicola. Currently, Prevotella histicola genomes are assembled to the scaffold and contig level, with only one fully assembled genome available for a different strain (NCBI RefSeq assembly GCF_018128125.1). This underscores the significance of our work to fully assemble a closed genome for P. histicola T05-04.
The 3 Mbp genome for P. histicola T05-04 has a 41.5% GC content. Final assembly coverage averaged to 232× (contig 1 = 213×, contig 2 = 266×, contig 3 = 2,345×). Sequence annotation for the genome was performed using the NCBI Prokaryotic Genome Annotation Pipeline (10) predicting 2,110 protein-coding genes and 2,430 genes in total.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Jones DR, Smith MB, Mc Lean R, Grondin JM, Amundsen CR, Inglis GD, Selinger B, Abbott DW. 2019. Engineering dual-glycan responsive expression systems for tunable production of heterologous proteins in Bacteroides thetaiotaomicron. Sci Rep 9:17400. doi:10.1038/s 41598-019-53726-w 31758019 PMC 6874557 · doi ↗ · pubmed ↗
- 2Yokoyama MT, Carlson JR, Holdeman LV. 1977. Isolation and characteristics of a skatole-producing Lactobacillus sp. from the bovine rumen. Appl Environ Microbiol 34:837–842. doi:10.1128/aem.34.6.837-842.1977563703 PMC 242757 · doi ↗ · pubmed ↗
- 3Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi:10.1038/s 41587-019-0072-830936562 · doi ↗ · pubmed ↗
- 4Vasimuddin M, Misra S, Li H, Aluru S. 2019. Efficient architecture-aware acceleration of BWA-MEM for multicore systems. 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS); Rio de Janeiro, Brazil: , p 314–324
- 5Vaser R, Sović I, Nagarajan N, Šikić M. 2017. Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res 27:737–746. doi:10.1101/gr.214270.11628100585 PMC 5411768 · doi ↗ · pubmed ↗
- 6Git Hub - nanoporetech/medaka: sequence correction provided by ONT Research[online]. Github. Accessed 20 January 2025. https://github.com/nanoporetech/medaka.
- 7Bouras G, Grigson SR, Papudeshi B, Mallawaarachchi V, Roach MJ. 2023 Dnaapler: A tool to reorient circular microbial genomes. JOSS 9:5968. doi:10.21105/joss.05968 · doi ↗
- 8Wick RR, Schultz MB, Zobel J, Holt KE. 2015. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics 31:3350–3352. doi:10.1093/bioinformatics/btv 38326099265 PMC 4595904 · doi ↗ · pubmed ↗
