Chromosome level genome assembly and annotation of Hanseniaspora mollemarum CBS 18055 strain
Nicole Xanthe Bennis, Harshitha Indudhar Hanji, Marcel van den Broek, Jean-Marc Daran

TL;DR
This paper provides a detailed genome assembly of the Hanseniaspora mollemarum strain, which could help improve its use in wine and cider fermentation.
Contribution
The study provides a chromosome-level genome assembly of the Hanseniaspora mollemarum CBS 18055 strain.
Findings
A chromosome-level genome assembly of Hanseniaspora mollemarum CBS 18055 was generated.
The assembly enhances the potential application of this strain in fermentation processes.
Abstract
Hanseniaspora species gained attention due to the ability of these species to ferment simple sugars and to actively contribute to the development of bouquet aromas in wine and cider fermentations. We present a chromosome-level assembly of an isolate of Hanseniaspora mollemarum that would enhance its potential applications.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
| Canu (ONT) | UniCycler hybrid | |||
|---|---|---|---|---|
| Assembly size (Mb) | 9.18 ( | |||
| N50 (Mb) | 1.25 ( | |||
| N90 (Kb) | 105.20 ( | |||
| GenBank | Chr | Size (bp) | Size (bp) | |
| contig1 (bp) |
| Chr 8 | 1,960,557 | |
| contig2 (bp) |
| Chr 7 | 1,481,168 | |
| contig3 (bp) |
| Chr 6 | 1,331,897 | |
| contig4 (bp) |
| Chr 5 | 1,256,963 | |
| contig5 (bp) |
| Chr 4 | 1,027,777 | |
| contig6 (bp) |
| Chr 3 | 994,132 | |
| contig7 (bp) |
| Chr 2 | 662,091 | |
| contig8 (bp) |
| Chr 1 | 469,443 | |
| contig9 – circular (bp) |
| mtDNA | 32,902 | |
| Nuclear DNA (bp) | 9,184,028 | |||
| Total (bp) | 9,216,930 |
- —EZK | Topconsortium voor Kennis en Innovatie (TKI)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Fermentation and Sensory Analysis · Microbial Natural Products and Biosynthesis
ANNOUNCEMENT
The Hanseniaspora genus comprises over 20 species including the recently described Hanseniaspora gamundiae (1), Hanseniaspora smithiae (2), Hanseniaspora terricola (3), and Hanseniaspora mollemarum. The latter species was isolated in a soil sample from a Dutch garden (CBS 15034) (4). Subsequently, new strains from Great-Britain and Hungary have been deposited at the Westerdijk Fungal Biodiversity Institute (https://wi.knaw.nl/). However, the sole H. mollemarun genome currently available derives from the CBS 15034 strain. To match the interest of the beverage industry for Hanseniaspora strains, it is critical to gain both physiological and genomic knowledge. Here, we report the genome sequence of a new strain of H. mollemarum.
The H. mollemarum CBS 18055 was isolated from an apple tree (Malus domestica) located in the south of France (43°01’24.7”N, 0°58’30.4”E). The sample was grown at 4°C on YPD medium (10 g L^−1^ yeast extract, 20 g L^−1^ Bacto-peptone, and 20 g L^−1^ glucose) supplemented with 3% ethanol. Single colonies were obtained by re-streaking the culture on a solid YPD medium supplemented with 25 µg mL^−1^chloramphenicol and 50 µg mL^−1^ ampicillin incubated at room temperature. This process was repeated thrice. Final isolates were stored in 30% (vol/vol) glycerol at −80°C.
Sanger sequencing of the ITS and the D1/D2 region of the rDNA of CBS 18055 amplified with primers ITS1/ITS4 (5) and NL1/NL4 (6), respectively, revealed a 99.98 and 100% identity to that of H. mollemarum CBS 15034, respectively.
For genome sequencing, total DNA was extracted using the Genomic-tip 100 /G kit protocol (Qiagen, Hilden, Germany). Shotgun library was prepared using Oxford_Nanopore Technologies' SQK-LSK109 kit (ONT, Oxford, United Kingdom) and sequenced on a MinION MK1B device with R10 flow cell. Raw FAST5 signal files were base-called using GPU Guppy (ONT, version 4.5.4) in high-accuracy mode. After filtering for length (>1 kb), 94,060 reads were obtained, yielding 1.04 Gb (N50: 19,089 bp), representing ~112-fold coverage of a H. mollemarum genome. Canu version 2.0 with settings genomeSize = 12 m, useGrid = 0, and nanopore-raw was used for de novo assembly yielding eight contigs (7). To correct errors in the assembly, a 150 bp read length TruSeq PCR-Free Illumina library (Illumina, San Diego, CA) with a 350 bp insert-size was sequenced on a NovaSeq 6000 (Illumina) by Novogene (Cambridge, United Kingdom) yielding 18,811,684 reads for a total of 2.82 Gigabases (307-fold coverage). Reads quality was assessed with FastQC v0.11.5. The genome was polished by mapping the untrimmed Illumina reads with Burrows-Wheeler Aligner (BWA version 0.7.15-r1142-dirty; default parameters used) (8, 9) to the assembly and further processed with SAMtools (version 1.3.1) (10, 11) and polished once by Pilon (version 1.18) with settings fix all (12).
This yielded a genome assembly of 9.18 Mb (N50 = 1.25 Mb) with a GC content of 34.96%, comprising eight contigs. Contig 1–7 (CHR2-to-8) showed telomeric sequences at both ends, whereas contig 8 (CHR1) had telomere sequences on one chromosome arm indicating that this contig also represented (part of) a linear chromosome (Table 1). Due to Canu’s being notoriously known to have difficulty in assembling circular contigs, the fastq files were assembled using UniCycler Hybrid assembler (13). A circular contig of 32.9 kbp was identified as mitochondrial chromosome (Table 1). Annotation of the polished assembly was performed using Funannotate v1.8.15 ((https://github.com/nextgenusfs/funannotate) (14) using ab initio gene predictors SNAP (v2013_11_29), Augustus (3.5.0), Genemark.HMM (ES Suite v4), and glimmerhmm (v3.0.4) in combination with default Funannotate databases, for example, pfam, interpro, andUniprot as previously described in (15).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Čadež N, Bellora N, Ulloa R, Hittinger CT, Libkind D. 2019. Genomic content of a novel yeast species Hanseniaspora gamundiae sp. nov. from fungal stromata (Cyttaria) associated with a unique fermented beverage in Andean Patagonia, Argentina. P Lo S ONE 14:e 0210792. doi:10.1371/journal.pone.021079230699175 PMC 6353571 · doi ↗ · pubmed ↗
- 2Čadež N, Bellora N, Ulloa R, Tome M, Petković H, Groenewald M, Hittinger CT, Libkind D. 2021. Hanseniaspora smithiae sp. nov., a novel apiculate yeast species from patagonian forests that lacks the typical genomic domestication signatures for fermentative environments. Front Microbiol 12:679894. doi:10.3389/fmicb.2021.67989434367085 PMC 8334367 · doi ↗ · pubmed ↗
- 3Liu Z, Wang M-M, Wang G-S, Li A-H, Wang Q-M. 2019. Hanseniaspora terricola sp. nov., an ascomycetous yeast isolated from Tibet. Int J Syst Evol Microbiol 71. doi:10.1099/ijsem.0.00474133720007 · doi ↗ · pubmed ↗
- 4Groenewald M, Lombard L, de Vries M, Lopez AG, Smith M, Crous PW. 2018. Diversity of yeast species from Dutch garden soil and the description of six novel Ascomycetes. FEMS Yeast Res 18. doi:10.1093/femsyr/foy 07630016423 · doi ↗ · pubmed ↗
- 5White TJ, Bruns T, Lee S, Taylor J. 1990. Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics, p 315–322. In Innis MA, Gelfand DH, Sninsky JJ, White TJ (ed), PCR Protocols. Academic Press, San Diego.
- 6O’Donnell K. 1993. Fusarium and its near relatives, p 225–233. In Reynolds DR, Taylor JW (ed), The fungal holomarph: mititic, meiotic and pleomorpic speciation in fungal systematic. CAB International, Wallingford.
- 7Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. doi:10.1101/gr.215087.11628298431 PMC 5411767 · doi ↗ · pubmed ↗
- 8Li H. 2013. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. ar Xiv Genomics. doi:10.48550/ar Xiv.1303.3997 · doi ↗
