Genome sequence of the yeast Cyberlindnera sylvatica UCD1060, isolated from soil in Ireland
Padraic G. Heneghan, Adam P. Ryan, Sean A. Bergin, Margarita Caka, Aimee Coughlan, Julianna Decuseara, Michalina Gora, Daniil Hudov, Aoife McLoughlin, Sheila Osadolor, Isabel Saeed Maguire, Weronika Swistowska, Kenneth H. Wolfe, Geraldine Butler, Kevin P. Byrne

TL;DR
This paper reports the genome sequence of a yeast species, Cyberlindnera sylvatica, isolated from soil in Ireland.
Contribution
The study provides the first genome sequence of Cyberlindnera sylvatica strain UCD1060.
Findings
The genome of C. sylvatica UCD1060 is 13.5 Mb in size.
It was assembled into seven chromosome-sized contigs and a mitochondrial genome contig.
Abstract
Cyberlindnera sylvatica is a member of the Cyberlindnera clade in the order Phaffomycetales. We present the genome sequence of C. sylvatica. The sequenced strain, UCD1060, was isolated from soil in Wicklow, Ireland. This genome is 13.5 Mb and was assembled into seven chromosome-sized contigs plus a mitochondrial genome contig.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsYeasts and Rust Fungi Studies · Plant Pathogens and Fungal Diseases · Mycorrhizal Fungi and Plant Interactions
ANNOUNCEMENT
Cyberlindnera sylvatica is a recently characterized (1) species in the budding yeast order Phaffomycetales. It was first isolated in Hungary and Germany from forests. It has no known food or biotechnological applications. As it grows at 37°C, it could potentially be associated with mammals (1).
C. sylvatica UCD1060 was isolated from soil collected from the base of a large oak in the Devil’s Glen, County Wicklow, Ireland (GPS coordinates 53.0163025,–6.1524822) as part of an undergraduate research module (2).
Soil material was passaged twice at room temperature in 9 mL liquid yeast extract-peptone-dextrose (YPD) containing chloramphenicol (30 µg/mL) and ampicillin (100 µg/mL) and cultured on YPD agar plates. The species was identified by PCR and Sanger sequencing of the ribosomal DNA internal transcribed spacer (ITS) and D1/D2 regions (accession numbers PQ438548 and PQ438569), using primers ITS1 (TCCGTAGGTGAACCTGCGG) and ITS4 (TCCTCCGCTTATTGATATGC) (3), and for the D1/D2 region NL1 (GCATATCAATAAGCGGAGGAA) and NL4 (GGTCCGTGTTTCAAGACGG) (4). Sequence identity was 98.36% (540/549 bp) in the ITS and 99.96% (555/557 bp) in the D1/D2 region to the type strain of C. sylvatica (accession numbers MT305876 and MT316316; note MT305876 is misnamed in GenBank as “Hyphopichia lachancei” but comes from CBS 16335, the type strain of C. sylvatica [1]).
DNA was isolated by phenol/chloroform extraction from liquid YPD cultures grown at 20°C. Short-read library construction (Illumina DNA-Prep(M) Ref:20060060) and sequencing (UCD Conway Core Facility, Dublin, Ireland) used a NextSeq2000 and P1 flowcell, yielding 5.7 million read pairs (2 × 150 bp; 127× coverage). Adapters and low-quality reads were removed (Skewer version 0.2.2) (5). For long-read sequencing, after DNA extraction (Biosearch Technology Masterpure yeast DNA purification kit MPY80010), library preparation (Native Barcoding Kit SQK-NBD112-24), end repair (NEB-M6630), and ligation (NEB-E6056), we selected for DNA >3 kb with Long Fragment Buffer before Oxford Nanopore sequencing (MinION MK1C, flowcell FLO-MIN112 R10.4). Default fast basecalling was done on the instrument (MinKNOW version 23.07.12; default demultiplexing; barcode trimming; reads ≥500 bp kept; no quality cutoff; 74× coverage). NanoFilt (version 2.3.0) (6) retained 70,996 reads with quality ≥10 and length ≥1,000 bp (reads N50 = 21,004 bp), which were then assembled using Canu (version 2.2) (7), followed by five rounds of error correction using NextPolish (version 1.4.1) with the Illumina reads (8). Default parameters were used, except where otherwise noted.
The assembly consisted of seven nuclear contigs (total 13.5 Mb; N50 = 2,385,092 bp) and the mitochondrial genome (33,689 bp circular unit; accession number CAXWVZ010000008). Six nuclear contigs terminate with tandem repeats of (GGGTGTCT)n at both ends, so they are inferred to be complete chromosomes with telomeres (9). The seventh nuclear contig (CAXWVZ010000007) has this telomeric repeat at one end and an rDNA array at the other end; this is the only rDNA locus in the genome.
Using BUSCO version 5.1.2 (10), genome completeness was estimated as 97.36% compared to the Ascomycota lineage data set. G + C content is 45.92%.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Brysch-Herzberg M, Dlauchy D, Seidel M, Péter G. 2021. Cyberlindnera sylvatica sp. nov., a yeast species isolated from forest habitats. Int J Syst Evol Microbiol 71. doi:10.1099/ijsem.0.00447733507858 · doi ↗ · pubmed ↗
- 2Bergin SA, Allen S, Hession C, Ó Cinnéide E, Ryan A, Byrne KP, Ó Cróinín T, Wolfe KH, Butler G. 2022. Identification of European isolates of the lager yeast parent Saccharomyces eubayanus . FEMS Yeast Res 22. doi:10.1093/femsyr/foac 053PMC 972644736473696 · doi ↗ · pubmed ↗
- 3Kurtzman CP, Robnett CJ. 1998. Identification and phylogeny of ascomycetous yeasts from analysis of nuclear large subunit (26S) ribosomal DNA partial sequences. Antonie Van Leeuwenhoek 73:331–371. doi:10.1023/a:10017610088179850420 · doi ↗ · pubmed ↗
- 4Leaw SN, Chang HC, Sun HF, Barton R, Bouchara J-P, Chang TC. 2006. Identification of medically important yeast species by sequence analysis of the internal transcribed spacer regions. J Clin Microbiol 44:693–699. doi:10.1128/JCM.44.3.693-699.200616517841 PMC 1393093 · doi ↗ · pubmed ↗
- 5Jiang H, Lei R, Ding SW, Zhu S. 2014. Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads. BMC Bioinformatics 15:182. doi:10.1186/1471-2105-15-18224925680 PMC 4074385 · doi ↗ · pubmed ↗
- 6De Coster W, D’Hert S, Schultz DT, Cruts M, Van Broeckhoven C. 2018. Nano Pack: visualizing and processing long-read sequencing data. Bioinformatics 34:2666–2669. doi:10.1093/bioinformatics/bty 14929547981 PMC 6061794 · doi ↗ · pubmed ↗
- 7Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k -mer weighting and repeat separation . Genome Res 27:722–736. doi:10.1101/gr.215087.11628298431 PMC 5411767 · doi ↗ · pubmed ↗
- 8Chen Z, Erickson DL, Meng J. 2021. Polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads to improve genomic analyses. Genomics 113:1366–1377. doi:10.1016/j.ygeno.2021.03.01833716184 · doi ↗ · pubmed ↗
