Near-complete genome sequence of the opportunistic yeast pathogen Nakaseomyces glabratus environmental isolate D77_3
Roumen Dimitrov, Nicolò Tellini, Matteo De Chiara, Gianni Liti, Dilnora Gouliamova

TL;DR
This paper reports the near-complete genome sequence of a yeast pathogen found in a beetle's gut in Bulgaria.
Contribution
The study provides a new genome sequence for Nakaseomyces glabratus, enhancing understanding of its genetic diversity.
Findings
The genome of Nakaseomyces glabratus isolate D77_3 was de novo assembled.
The isolate was obtained from the gut of the beetle Oxythyrea funesta in Sofia, Bulgaria.
Abstract
We present the near-complete genome sequence of the Nakaseomyces glabratus environmental isolate D77_3, obtained from the gut of the beetle Oxythyrea funesta (Coleoptera: Cetoniinae), collected in Sofia, Bulgaria. This genome was de novo assembled and adds valuable data on N. glabratus diversity.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —Bulgarian National Science Fundhttp://dx.doi.org/10.13039/501100003336
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsYeasts and Rust Fungi Studies · Plant Pathogens and Fungal Diseases · Mycorrhizal Fungi and Plant Interactions
ANNOUNCEMENT
Nakaseomyces glabratus (formerly Candida glabrata) is an opportunistic yeast pathogen causing fatal bloodstream infections (candidemia) in immunocompromised adults (1). We report the near-complete genome of environmental isolate D77_3, obtained from the dissected gut of the beetle Oxythyrea funesta collected in Sofia, Bulgaria. This genome contributes to understanding the evolutionary mechanisms enabling N. glabratus to adapt to diverse environments, including its shift from environmental to human-associated niches. Methods for the strain isolation and identification and accession numbers of ribosomal markers were described previously (2). The isolate is deposited in the Bulgarian National Collection of Microorganisms and Cell Cultures, under accession number NIMCC 9134. The strain is preserved in lyophilized form and maintained at +4°C. It is available from the authors upon request.
The strain was cultured on YPGA medium prior to DNA extraction. Genomic DNA was extracted using the Maxwell RSC Tissue DNA Kit (Promega). For PacBio sequencing, 8 µg DNA was sheared (Covaris) and size-selected with AMPurePB beads. Libraries were prepared using the SMRTbell Template Prep Kit 1.0 (PacBio), and subreads fully contained within other subreads or showing abnormal overlaps were removed. A total of 145,092 PacBio long reads were generated. For Illumina sequencing, 2 µg DNA was sheared (Covaris), end-repaired, A-tailed, and ligated with TruSeq DNA UD adapters (Illumina). Libraries were quantified by qPCR using the KAPA Library Quantification Kit (Roche), and quality was assessed using a Bioanalyzer (Agilent). After adapter trimming and quality filtering (Phred score ≥Q20), 16,454,074 Illumina reads were retained, with 95.88% of bases ≥ Q30.
De novo assembly of PacBio reads was performed using HGAP3 (v3.0) (3), followed by three polishing rounds with Pilon v1.21 (4) using Illumina data. Post-polishing, 16,440,965 Illumina reads mapped to the assembly, covering 99.96% of the genome at an average depth of 182.27×. Completeness was assessed with BUSCO v3.0, using eukaryota_odb9 and Saccharomyces cerevisiae data sets (5) . Assembly quality was evaluated using QUAST v5.0.2 (6) by comparison to the N. glabratus reference genome (GCF_000002545.3_ASM254v2), confirming 99.13% completeness.
Telomeric repeats were detected using the motif TCTGGGTGCTGTGGGG and its reverse complement. Contigs 1, 5, and 8 contained repeats at both ends, indicating complete chromosomes. Alignment confirmed other contigs mostly correspond to full chromosomes, except for terminal telomeric repeats. Structural rearrangements between chromosomes 4 and 12, involving contigs 2, 12, and 13, were identified.
The final assembly totaled 12.65 Mb, with a median PacBio coverage of 76×, an average GC content of 38.98%, and 15 contigs (6 larger than 1,000 bp; N50 = 1,058,520 bp and 9 smaller contigs < 1,000 bp).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Pfaller MA, Diekema DJ. 2007. Epidemiology of invasive candidiasis: a persistent public health problem. Clin Microbiol Rev 20:133–163. doi:10.1128/CMR.00029-0617223626 PMC 1797637 · doi ↗ · pubmed ↗
- 2Gouliamova DE, Dimitrov RA, Smith MT, Groenewald M, Stoilova-Disheva MM, Guéorguiev BV, Boekhout T. 2016. DNA barcoding revealed Nematodospora valgi gen nov., sp. nov. and Candida cetoniae sp. nov. in the Lodderomyces clade. Fungal Biol 120:179–190. doi:10.1016/j.funbio.2015.10.00426781375 · doi ↗ · pubmed ↗
- 3Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, Clum A, Copeland A, Huddleston J, Eichler EE, Turner SW, Korlach J. 2013. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods 10:563–569. doi:10.1038/nmeth.247423644548 · doi ↗ · pubmed ↗
- 4Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. P Lo S One 9:e 112963. doi:10.1371/journal.pone.011296325409509 PMC 4237348 · doi ↗ · pubmed ↗
- 5Manni M, Berkeley MR, Seppey M, Zdobnov EM. 2021. BUSCO: assessing genomic data quality and beyond. Curr Protoc 1:e 323. doi:10.1002/cpz 1.32334936221 · doi ↗ · pubmed ↗
- 6Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi:10.1093/bioinformatics/btt 08623422339 PMC 3624806 · doi ↗ · pubmed ↗
