Contamination of fungal genomes of Onygenaceae (Phylum Ascomycota) in public databases: incidence, detection, and impact
Alan Omar Granados-Casas, Ana Fernández-Bravo, Alberto Miguel Stchigel, José Francisco Cano-Lira

TL;DR
This study finds contamination in public fungal genomes from the Onygenaceae family and shows how removing it improves genome quality and reliability.
Contribution
The study introduces a contamination screening and removal workflow for fungal genomes, specifically in the Onygenaceae family.
Findings
Four Onygenaceae genomes had contamination levels between 5 and 12%, mostly bacterial.
Contamination removal reduced contamination levels to below 3% in all cases.
Filtered assemblies showed improved genome quality and fewer bacterial protein families.
Abstract
Genomic datasets often contain unwanted, foreign, or erroneous nucleotide sequences that do not belong to the organism under study. Such contamination can significantly compromise genome analyses, reducing the accuracy and reliability of the results. Despite its potential impact, few studies have addressed the contamination of fungal genomes by exogenous sequences. Here, we analyzed eleven publicly available genomes of fungi from the family Onygenaceae, retrieved from the National Center for Biotechnology Information (NCBI) database. A comprehensive quality assessment was performed, evaluating genome completeness, contiguity, and contamination levels. Genomes with lower statistical quality and putatively contaminated were selected for further improvement. To enhance assembly quality, we built a custom Kraken 2 database including four high-quality genomes of closely related fungal taxa.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Fungal and yeast genetics research · Protist diversity and phylogeny
