Genome resource announcement of a Leptodophora sp. fungus isolated from roots of broadleaf plants in Wisconsin, USA
Tristan Yang, Amanda Certano, Venkatachalam Lakshmanan, Sebastian Bassi, Neeraj Purushotham, Tegan Nock, Abed Chaudhury, Animesh Ray

TL;DR
This paper announces the genome of a fungus found in plant roots in Wisconsin, including details about its genetic makeup and potential toxin production.
Contribution
The paper provides a new genome sequence for a Leptodophora sp. fungus with high proteome completeness and identifies secondary metabolism gene clusters.
Findings
The genome consists of 21 contigs with 75.14 mega-base pairs and encodes 19,447 proteins.
The fungus has a 98.4% BUSCO proteome completeness score.
The genome contains 55 secondary metabolism gene clusters, including those for producing potential toxins.
Abstract
We report whole genome sequences of a plant root-associated fungus Leptodophora sp., consisting of 21 contigs representing 75.14 mega-base pairs genome encoding 19,447 proteins, with 98.4% BUSCO proteome completeness score. Secondary metabolism genes constitute 55 clusters. This species likely synthesizes toxic alternapyrone, 6-methylsalicylic acid, and fusarin.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —Loam Bio Pty Ltd. (USA/Australia)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPlant Pathogens and Fungal Diseases · Plant-Microbe Interactions and Immunity · Genomics and Phylogenetic Studies
ANNOUNCEMENT
The genus Leptodophora (1) (previously, Cadophora (2)) contains endophytic fungi known for their non-pathogenic root association. Some members are dark septate endophytes (3) important for plant nutrition, stress, pathogen deterrence, and community dynamics (4).
A pure fungal sample (TYLB-001) was isolated from broadleaf plant roots in Spooner, WI, by surface sterilizing and plating on potato dextrose agar (PDA, Sigma) with Penicillin−Streptomycin−Neomycin (Sigma, P4083) and subculturing (25°C for several weeks). DNA was isolated from liquid cultures by grinding in liquid nitrogen, lysed with SDS (1% v/w), treated with ribonuclease A and proteinase K, and extracted with chloroform:isoamyl alcohol (24:1, v/v); DNA was precipitated in ethanol (5). DNA libraries were prepared at the University of Minnesota Genomics Center (St. Paul, MN; PacBio HiFi long-read sequencing) (6), and at the Australian Genome Research Facility (for Illumina short-read using Nextera DNA Flex Library Preparation kit). Long-read size selection was by High Pass Plus (>15 kb) and sequencing was on PacBio Sequel II with 8M SMRT Cell. HiFi reads were generated using Circular Consensus Sequencing, which are already quality-filtered and error-corrected; additional trimming or correction was unnecessary. A total of 324,971 HiFi reads were generated, with N50 = 15,785 bp.
Illumina paired-end reads were trimmed using fastp v.0.23.2 (7); 20,796,372 paired-end reads were assembled. Jellyfish 2.2.8 (8) and GenomeScope 2.0 (9) estimated initial genome sizes (k-mer frequency analysis). Short-read sequence data captured 96.9% of the genome by GenomeScope v.2.0 model fitting. HiFiasm v.0.19.9-r616 (10) assembled 324,971 long-reads; scaffolded by SSPACE-Long v.1.1 (11); gaps closed by LR_Gapcloser v.2018.9.4 (12); trimmed paired-end reads were aligned to the gap-closed assembly using Burrows-Wheeler transformation algorithm v.0.7.17-r1188, and the resulting SAM file was processed in SAMTools v.1.12. Pilon v.1.24 (13) was used to polish and gap-fill short reads. Repeats were “softmasked” by RepeatMasker v.4.1.2-pl on the DfAM-3.9 (14, 15) “slow search” and “Fungi” species options. The HiFi reads covered 90.8% of the genome.
The final genome was 75,142,167 bp in 21 contigs (N50 = 5,880,590 bp, 45.53% GC). Protein-coding genes were predicted by Funannotate v.1.8.17 (16) with AUGUSTUS (17), GeneMark-ES-2 v.1.14_1.25 (18), SNAP v.2.2.4-14ubuntu2 (19), and GlimmerHMM v.3.0.4 (20). Gene models produced a consensus by Evidence Modeler v.2.1.0 (21). Diamond v.2.1.11 (22) and BEDtools 2.31.0 (23) filtered low-quality models. A total of 19,447 protein-coding genes were found. tRNAscan-SE v.2.0 predicted tRNA genes (24). Phylogenetic analysis was performed using the Internal Transcribed Spacer (ITS) region spanning ITS1F (5′CTTGGTCATTTAGAGGAAGTAA3′) and ITS4 (5′TCCTCCGCTTATTGATATGC3′) aligned via NCBI BLAST. This placed the fungus in the genus Leptodophora. BUSCO v.2.0 (25) on dikarya_odb9 (26) produced 98.4% proteome completeness score. We predicted biosynthetic gene clusters with antiSMASH v.6.1.1 (27) and found 55 clusters, including those likely to synthesize alternapyrone, 6-methylsalicylic acid, and fusarin. The complete analytical pipeline is available at https://doi.org/10.17504/protocols.io.e6nvw14nwlmk/v5.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Topilina YS, Luk’yanova EA, Glukhova LB, Shurupova MN, Gerasimchuk AL, Frank YA, Antsiferov DV. 2023. Beneficial effect of the new Leptodophora sp. strain on development of blueberry microclones in the process of their adaptation. Microorganisms 11:1406. doi:10.3390/microorganisms 1106140637374907 PMC 10301985 · doi ↗ · pubmed ↗
- 2Koukol O. 2022. Leptodophora gen. nov. (Helotiales, Leotiomycetes) proposed to accommodate selected root-associated members of the genus Cadophora. Czech Mycol 74:57–66. doi:10.33585/cmy.74104 · doi ↗
- 3Daghino S, Martino E, Voyron S, Perotto S. 2022. Metabarcoding of fungal assemblages in Vaccinium myrtillus endosphere suggests colonization of above-ground organs by some ericoid mycorrhizal and DSE fungi. Sci Rep 12:11013. doi:10.1038/s 41598-022-15154-135773465 PMC 9246922 · doi ↗ · pubmed ↗
- 4Mandyam K, Jumpponen A. 2005. Seeking the elusive function of the root-colonising dark septate endophytic fungi. Stud Mycol 53:173–189. doi:10.3114/sim.53.1.173 · doi ↗
- 5Jones A, Purushotham N, Nasim J, Schwessinger B. 2020. DNA clean-up and size selection for long-read sequencing. Available from: https://www.protocols.io/view/dna-clean-up-and-size-selection-for-long-read-sequ-betdjei 610.1371/journal.pone.0253830 PMC 828202834264958 · doi ↗ · pubmed ↗
- 6Wenger AM, Peluso P, Rowell WJ, Chang P-C, Hall RJ, Concepcion GT, Ebler J, Fungtammasan A, Kolesnikov A, Olson ND, et al.. 2019. Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome. Nat Biotechnol 37:1155–1162. doi:10.1038/s 41587-019-0217-931406327 PMC 6776680 · doi ↗ · pubmed ↗
- 7Chen S, Zhou Y, Chen Y, Gu J. 2018. Fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34:i 884–i 890. doi:10.1093/bioinformatics/bty 56030423086 PMC 6129281 · doi ↗ · pubmed ↗
- 8Marçais G, Kingsford C. 2011. A fast, lock-free approach for efficient parallel counting of occurrences of k -mers. Bioinformatics 27:764–770. doi:10.1093/bioinformatics/btr 01121217122 PMC 3051319 · doi ↗ · pubmed ↗
