Draft genome assembly and annotation of Haematococcus lacustris strain Liv1, an industrial astaxanthin-producing microalga
Michael J. Lashbrook, Jeremiah A. Adesanya, Jayson Talag, Kathleen Lail, Raymond Lee, Anna M. Lipzen, Jie Guo, Jerry Jenkins, Alan Kuo, Christa Pennacchio, Igor V. Grigoriev, Rhona K. Stuart, Christopher S. Ward

TL;DR
This paper presents the genome of Haematococcus lacustris, a microalga used for producing natural astaxanthin, to aid future research on carotenoid production.
Contribution
The study provides the first annotated genome for H. lacustris strain Liv1.
Findings
The genome size is 291.5 Mbp.
The genome is annotated to support research in carotenoid biosynthesis and crop protection.
Abstract
Haematococcus lacustris is a ubiquitous unicellular green alga with industrial bioproduct applications, namely, as feedstock for natural astaxanthin. We report the annotated 291.5 Mbp genome for H. lacustris Liv1 to support future algal research in the areas of carotenoid biosynthesis and crop protection.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
| Characteristics | |
|---|---|
| Genome assembly size (Mbp) | 291.5 |
| Sequencing read coverage depth | 297× |
| No. of scaffolds | 1,648 |
| Scaffold N50 (kb) | 360 |
| Scaffold L50 | 222 |
| No. of gaps | 0 |
| GC content (%) | 60.5 |
| No. of genes | 21,755 |
| No. of protein-coding genes | 21,367 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgal biology and biofuel production · Marine and coastal ecosystems · Microbial Community Ecology and Physiology
ANNOUNCEMENT
Haematococcus lacustris (class Chlorophyceae, order Chlamydomonadales, family Haematococcaceae) is a microalgal species that is cultivated industrially for astaxanthin, a carotenoid with strong antioxidant properties produced under various abiotic stress conditions (1). Genomic insights into improving astaxanthin production are limited by the availability of only two sequenced strains (i.e., H. lacustris NIES-144 and H. lacustris SAG192.80; [2–4]). As demand for astaxanthin increases, having genome resources that reflect the functional and ploidy diversity of the Haematococcus genus will become increasingly valuable.
The strain H. lacustris Liv1 was selected for industrial cultivation due to its high biovolume and astaxanthin content per cell. For genome sequencing, the culture was obtained from Lawrence Livermore National Lab (USA) and maintained in Bristol’s modified medium (5) under 14/10 h light/dark photocycles (80 μE m^−2^ s^−1^) at 25°C. To create a pooled transcript library, sub-cultures were transferred to fresh Bristol’s media, heterotrophic media (Tris-acetate-phosphate, [6]), and reddening media (7) incubated in normal light, as well as Bristol’s media in dark (0 μE m^−2^ s^−1^). Nucleic acid extraction using Quick-DNA/RNA Miniprep Kit (Zymo) was performed at the Arizona Genomics Institute.
Genomic DNA was sequenced at the Joint Genome Institute (JGI) using PacBio and Illumina sequencing, following similar protocols to (8). For PacBio, 10 µg of genomic DNA was sheared to 30–50 kb using Megaruptor 3 (Diagenode). Sheared DNA was treated with exonuclease, DNA damage repair enzyme mix, and end-repair/A-tailing mix; ligated with overhang adapters using SMRTbell Express Template Prep Kit 2.0 (PacBio); and purified with AMPure PB Beads (PacBio). Individual libraries were size-selected using 0.75% agarose gel cassettes with Marker S1 and High Pass protocol on the BluePippin (Sage Science). PacBio Sequencing primers were then annealed to the SMRTbell template library, and sequencing polymerase was bound using Sequel II Binding kit 1.0. The prepared SMRTbell template libraries were then sequenced on a Pacific Biosciences Sequel II sequencer using 8M v.1 SMRT cells and v.1.0 sequencing chemistry with 1 × 900 sequencing movie run times. For Illumina, 100 ng of genomic DNA was sheared to 500 bp using the Covaris LE220 and size-selected with SPRI using TotalPure NGS beads (Omega Bio-tek). Fragments were treated with end repair, A-tailing, and ligation of Illumina-compatible adapters (IDT) using the KAPA-HyperPrep kit (KAPA Biosystems). For transcriptome sequencing, stranded cDNA libraries were generated using the Illumina Truseq Stranded RNA LT kit. mRNA was purified from 1 µg of total RNA using magnetic beads containing poly-T oligos. mRNA was fragmented and reverse transcribed using random hexamers and SSII (Invitrogen) followed by second-strand synthesis. The fragmented cDNA was treated with end pair, A-tailing, adapter ligation, and eight cycles of PCR. The prepared Illumina libraries were quantified using KAPA Biosystems’ Next-Generation Sequencing Library qPCR Kit (Roche) and run on a Roche LightCycler 480 Real-Time PCR instrument. The quantified library was then multiplexed with other libraries, and the pool of libraries was then prepared for sequencing on the Illumina NovaSeq 6000 sequencing platform (2 × 150 indexed run) using NovaSeq XP v.1 reagent kits and an S4 flow cell.
Genome assembly involved reads being assembled with MECAT2 (9) and polished with ARROW v.2.0 (Pacific Biosciences). Homozygous single-nucleotide polymorphisms and insertion–deletions were corrected in the raw assembly consensus using ~53× of Illumina reads. To facilitate gene annotation, the transcriptome (430 million paired-end reads) consisted of two parts: (i) Illumina reads were assembled with Trinity v.2.11; and (ii) PacBio reads were clustered into Consensi with IsoSeq v.9.0. Default parameters were used except where otherwise noted. Functional annotation using the JGI annotation (10) pipeline resulted in 21,367 protein-coding gene models (Table 1). The average estimated protein length is 422 amino acids, with 7.14 exons per gene. Genome assembly size and gene count are comparable to other published Haematococcus genomes (https://phycocosm.jgi.doe.gov/chlorophyceae). This genome will support continued optimization of microalgal bioproduction and contribute foundational data for genome-informed strain selection and development.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Oslan SNH, Oslan SN, Mohamad R, Tan JS, Yusoff AH, Matanjun P, Mokhtar RAM, Shapawi R, Huda N. 2022. Bioprocess strategy of Haematococcus lacustris for biomass and astaxanthin production keys to commercialization: perspective and future direction. Fermentation 8:179. doi:10.3390/fermentation 8040179 · doi ↗
- 2Morimoto D, Yoshida T, Sawayama S. 2020. Draft genome sequence of the astaxanthin-producing microalga Haematococcus lacustris strain NIES-144. Microbiol Resour Announc 9:e 00128-20. doi:10.1128/MRA.00128-2032499361 PMC 7272542 · doi ↗ · pubmed ↗
- 3Luo Q, Bian C, Tao M, Huang Y, Zheng Y, Lv Y, Li J, Wang C, You X, Jia B, Xu J, Li J, Li Z, Shi Q, Hu Z. 2019. Genome and transcriptome sequencing of the astaxanthin-producing green microalga, Haematococcus pluvialis. Genome Biol Evol 11:166–173. doi:10.1093/gbe/evy 26330496415 PMC 6330051 · doi ↗ · pubmed ↗
- 4Bian C, Liu C, Zhang G, Tao M, Huang D, Wang C, Lou S, Li H, Shi Q, Hu Z. 2023. A chromosome-level genome assembly for the astaxanthin-producing microalga Haematococcus pluvialis. Sci Data 10:511. doi:10.1038/s 41597-023-02427-137537173 PMC 10400597 · doi ↗ · pubmed ↗
- 5Bold HC. 1949. The morphology of Chlamydomonas chlamydogama, sp. nov. Bulletin of the Torrey Botanical Club 76:101. doi:10.2307/2482218 · doi ↗
- 6Harris EH. 1989. The Chlamydomonas sourcebook: a comprehensive guide to biology and laboratory use. Academic Press, San Diego.10.1126/science.246.4936.1503-a 17756009 · doi ↗ · pubmed ↗
- 7Sun H, Liu B, Lu X, Cheng KW, Chen F. 2017. Staged cultivation enhances biomass accumulation in the green growth phase of Haematococcus pluvialis. Bioresour Technol 233:326–331. doi:10.1016/j.biortech.2017.03.01128285225 · doi ↗ · pubmed ↗
- 8Calhoun S, Kamel B, Bell TAS, Kruse CPS, Riley R, Singan V, Kunde Y, Gleasner CD, Chovatia M, Sandor L, Daum C, Treen D, Bowen BP, Louie KB, Northen TR, Starkenburg SR, Grigoriev IV. 2022. Multi-omics profiling of the cold tolerant Monoraphidium minutum 26B-AM in response to abiotic stress. Algal Res 66:102794. doi:10.1016/j.algal.2022.102794 · doi ↗
