Seven genome sequences of Rothia spp. isolated from human saliva
Allen Choi, Lindsey Pia, Justin R. Kaspar

TL;DR
This paper presents the complete genome sequences of seven Rothia bacteria strains from human saliva, highlighting two species with different genetic compositions.
Contribution
The study provides new genome sequences for understudied Rothia species, expanding knowledge of their genetic diversity.
Findings
Seven Rothia strains were sequenced, including four R. mucilaginosa and three R. dentocariosa.
Genome GC content differed between species, with R. mucilaginosa at 59.5% and R. dentocariosa at 53.8%.
Abstract
Rothia are understudied commensal bacteria within the human oral cavity and respiratory tract and can cause opportunistic infections. We report the complete genome sequences of seven Rothia spp. strains isolated from pooled human saliva, including four strains of Rothia mucilaginosa (59.5% GC) and three strains of Rothia dentocariosa (53.8% GC).
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
| Strains |
|
| |||||
|---|---|---|---|---|---|---|---|
| SOSUI018 | SOSUI019 | SOSUI028 | SOSUI003 | SOSUI004 | SOSUI007 | SOSUI038 | |
| BioSample |
|
|
|
|
|
|
|
| GenBank accession |
|
|
|
|
|
|
|
| SRA accession | |||||||
| Total Illumina reads | 7,060,312 | 6,835,362 | 6,169,240 | 8,669,170 | 7,634,164 | 6,603,002 | 6,656,362 |
| % bp >Q30 | 94.2 | 94.5 | 94.8 | 93.6 | 92.6 | 94.2 | 93 |
| Illumina coverage (×) | 395 | 386 | 350 | 523 | 457 | 402 | 398 |
| Total Nanopore reads | 390,165 | 387,170 | 312,465 | 237,918 | 259,271 | 430,003 | 295,155 |
| % bp >Q20 | 83.9 | 84.3 | 84.6 | 82.1 | 78.9 | 85.1 | 79.5 |
| 11,583 | 10,988 | 13,469 | 12,746 | 9,715 | 12,981 | 11,374 | |
| Nanopore coverage (×) | 405 | 403 | 382 | 285 | 306 | 523 | 361 |
| Total assembled contigs | 1 | 1 | 1 | 1 | 2 | 1 | 1 |
| Completeness | 95.62 | 95.93 | 95.93 | 99.23 | 98.45 | 98.88 | 99.1 |
| Genome size | 2,500,955 | 2,481,985 | 2,481,987 | 2,297,340 | 2,296,472 | 2,298,761 | 2,302,922 |
| GC % | 53.82 | 53.88 | 53.88 | 59.48 | 59.55 | 59.48 | 59.49 |
| Best match type-strain (% average nucleotide identity) | |||||||
| Annotated genes | 2,166 | 2,158 | 2,152 | 1,811 | 1,818 | 1,814 | 1,805 |
| Protein coding genes | 2,091 | 2,085 | 2,081 | 1,740 | 1,743 | 1,743 | 1,732 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDiphtheria, Corynebacterium, and Tetanus · Mycobacterium research and diagnosis · Infections and bacterial resistance
ANNOUNCEMENT
Three species of the genus Rothia are known to colonize human hosts: Rothia aeria, Rothia dentocariosa, and Rothia mucilaginosa. For all three species, fewer than 30 high-quality genomes are publicly available (1). All Rothia species are gram-positive, facultative anaerobes with pleomorphic morphologies (2, 3). Rothia spp. colonizing humans are commonly found in the oral cavity and respiratory tract and are considered to be commensal bacteria that can cause opportunistic infections such as endocarditis, pneumonia, and septicemia (4–6). The role of R. dentocariosa in the development of dental caries is unknown, with the abundance of all Rothia spp. in the oral cavity decreasing as carious lesions progress (7–10).
Recently, our group isolated over 100 strains from commercially available pooled human saliva (11). This included 11 total isolates of R. mucilaginosa and 3 of R. dentocariosa. The first four isolated strains of R. mucilaginosa and all three R. dentocariosa isolates were selected for whole-genome sequencing. Isolates were first resuscitated from a −80°C glycerol stock using a BHI agar plate and grown for 48 h. Afterward, 5 mL of BHI broth was inoculated and grown overnight (37°C and 5% CO_2_). The following morning, genomic DNA was purified with the DNeasy PowerLyzer Microbial Kit (Qiagen; Catalog # 12255-50), and concentration was determined with a Qubit Flex Flurometer (ThermoFisher; Catalog # Q33327) and the Qubit dsDNA BR Assay Kit (ThermoFisher; Catalog # Q32850) according to the supplier’s protocol. The Microbial Genome Sequencing Center performed combined short- and long-read sequencing (Small Nanopore Combo; Illumina and Oxford Nanopore technologies [ONT], respectively) and de novo assembly. Default parameters were used except where otherwise noted. Illumina sequencing was performed on an Illumina NovaSeq X Plus, producing 2 × 151 bp pair-ended reads after libraries were prepared with the Illumina DNA Prep kit. No additional DNA fragmentation or size selection steps were performed. Demultiplexing, quality control, and adapter trimming were performed with bcl-convert (version 4.2.4). Nanopore sequencing, prepared using the Oxford Nanopore Technologies (ONT) Ligation Sequencing Kit (SQK-NBD144.24) using the supplier’s instructions, was performed on an Oxford Nanopore MinION Mk1B sequencer. Guppy (version 6.5.7) was used for basecalling, demultiplexing, and adapter removal (12). Total reads and coverage for each isolate sequenced are listed in Table 1. De novo genome assemblies were generated from ONT read data with Flye (version 2.9.2) using the nano-hq model (13). Subsequent polishing used the Illumina read data with Pilon (version 1.24) under default parameters (14). Assembled contigs were evaluated for circularization via Circulator (version 1.5.5) using the ONT long reads (15). Assembly annotation was performed by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) (version 6.9). Finally, assembly statistics were recorded with QUAST (version 5.2.0) (16).
All sequenced strains, except for SOSUI004, were assembled with complete circular genomes. For R. dentocariosa genomes, the total length varies between 2,481,985 and 2,500,955 bp, a GC content percentage of 53.8%, and the number of annotated genes between 2,152 and 2,166. For R. mucilaginosa, the total genome length varies between 2,296,472 and 2,302,922, with a GC content of 59.5% and between 1,805 and 1,818 annotated genes.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1West SR, Suddaby AB, Lewin GR, Ibberson CB. 2024. Rothia. Trends Microbiol 32:720–721. doi:10.1016/j.tim.2024.03.00938580605 · doi ↗ · pubmed ↗
- 2Lesher RJ, Gerencser MA, Gerencser VF. 1974. Morphological, biochemical, and serological characterization of Rothia dentocariosa. Int J Syst Bacteriol 24:154–159. doi:10.1099/00207713-24-2-154 · doi ↗
- 3Brown JM, Georg LK, Waters LC. 1969. Laboratory identification of Rothia dentocariosa and its occurrence in human clinical materials. Appl Microbiol 17:150–156. doi:10.1128/am.17.1.150-156.19694886858 PMC 377629 · doi ↗ · pubmed ↗
- 4Franconieri F, Join-Lambert O, Creveuil C, Auzou M, Labombarda F, Aouba A, Verdon R, de La Blanchardière A. 2021. Rothia spp. infective endocarditis: a systematic literature review. Infectious Diseases Now 51:228–235. doi:10.1016/j.medmal.2020.10.02133164836 · doi ↗ · pubmed ↗
- 5Ramanan P, Barreto JN, Osmon DR, Tosh PK. 2014. Rothia bacteremia: a 10-year experience at mayo clinic, Rochester, Minnesota. J Clin Microbiol 52:3184–3189. doi:10.1128/JCM.01270-1424951810 PMC 4313135 · doi ↗ · pubmed ↗
- 6Zheng X, Liu F, Ma Q, Li J, Ma H. 2024. Rothia dentocariosa endocarditis with brain abscess and splenic abscess: case report and brief review. Front Cardiovasc Med 11:1370736. doi:10.3389/fcvm.2024.137073638966752 PMC 11222609 · doi ↗ · pubmed ↗
- 7Heller D, Helmerhorst EJ, Gower AC, Siqueira WL, Paster BJ, Oppenheim FG. 2016. Microbial diversity in the early in vivo-formed dental biofilm. Appl Environ Microbiol 82:1881–1888. doi:10.1128/AEM.03984-1526746720 PMC 4784052 · doi ↗ · pubmed ↗
- 8Diaz PI, Chalmers NI, Rickard AH, Kong C, Milburn CL, Palmer RJ, Kolenbrander PE. 2006. Molecular characterization of subject-specific oral microflora during initial colonization of enamel. Appl Environ Microbiol 72:2837–2848. doi:10.1128/AEM.72.4.2837-2848.200616597990 PMC 1449052 · doi ↗ · pubmed ↗
