Genome sequences of two cyanobacteria strains isolated from hornworts
Warren Shou Leong Ang, Meghan M. Blaszynski, Jun B. Cai, Lindsey Markowitz, Eve A. Maunders, Andreas Norlin, Haley R. Womack, Fay-Wei Li

TL;DR
This paper presents the full genome sequences of two cyanobacteria strains found in hornworts, aiding future research on their symbiotic relationships.
Contribution
The paper provides two new complete genome assemblies of symbiotic cyanobacteria from hornworts.
Findings
Two complete genome assemblies of symbiotic cyanobacteria were sequenced.
The cyanobacteria were isolated from the hornworts Notothylas orbicularis and Phaeoceros carolinianus.
The datasets will support comparative genomic studies of symbiotic cyanobacteria.
Abstract
We report two complete genome assemblies of symbiotic cyanobacteria isolated from the hornwort species Notothylas orbicularis and Phaeoceros carolinianus. These new datasets will facilitate future comparative genomic studies across symbiotic cyanobacteria.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMicrobial Community Ecology and Physiology · Protist diversity and phylogeny · Legume Nitrogen Fixing Symbiosis
ANNOUNCEMENT
Hornworts have evolved symbiotic relationships with cyanobacteria to gain access to bioavailable nitrogen (1). Studying the interaction between hosts and cyanobionts is of particular interest for agricultural engineering. Despite the identification of many cyanobionts (2), few have their entire genome sequenced (3).
Here, we assembled genomes of two cyanobacteria strains isolated from hornworts—“C110” from Notothylas orbicularis and “C117” from Phaeoceros carolinianus, both were collected from upstate New York (GPS coordinates for C110: 42.451961, –76.454326, C117: 42.343843, –76.269182) (2, 3). Cyanobiont colonies were excised from thalli, sterilized with 5% bleach, and placed on BG11 plates with 6–25 µmol m^−2^ s^−1^ light intensity in 16/8 hour light/dark cycle at 22°C (3, 4). To obtain pure cultures, single colonies were restreaked at least twice. Cyanobacteria were then cultured in liquid BG11 media in the same conditions as before but with constant shaking. Cell density was measured by estimating chlorophyll-a concentration (5). Total chlorophyll-a mass of 250 µg (C110) and 65 µg (C117) were harvested by centrifugation. Genomic DNA was isolated using a Wizard HMW DNA Extraction Kit (Promega) and quantified using the Qubit 1X dsDNA High Sensitivity Kit on a Qubit 4 fluorometer (Invitrogen). A total of 200 ng of DNA was used for library preparation with the Rapid Sequencing Kit V14 (SQK-RAD114, Oxford Nanopore Technologies, UK) without any size selection. DNA fragment shearing was minimized with careful handling and loaded onto a MinION R10.4.1 Flow Cell (Oxford Nanopore Technologies, UK) to sequence for 24 hours.
Unless otherwise stated, default settings were used for all software processing. Raw data were basecalled using Dorado v0.7.3 in super accuracy mode with duplex basecalling (https://github.com/nanoporetech/dorado). Duplex and simplex reads were combined to generate a total of 879,153,448 bases from 160,803 reads (N50: 71,877 bp) and 1,784,282,371 bases from 201,384 reads (N50: 18,452 bp) for C110 and C117, respectively. Reads were filtered based on a minimum length cutoff of 1 k for C110 and 30 k for C117, and a minimum quality cutoff of Q6 for C110 and Q20 for C117. Genomes were assembled using Flye v2.9.2 (6), which circularized contigs when sufficient overlaps were identified.
The assembly from C110 yielded 11 circular contigs and 1 linear contig, totaling 8,366,001 bp (41.42% GC content). The largest circular contig was 7,174,855 bp, while others ranged from 32,854 bp to 331,577 bp. For C117, the assembly yielded four circular and two linear contigs, totaling 8,449,107 bp (39.7% GC content). The largest circular contig was 7,883,191 bp, while others ranged from 86,831 bp to 280,883 bp. Both genomes fall within the expected size range of a typical cyanobacteria. Completeness of both genomes was analyzed using Compleasm v0.2.6 (7) which scored 99.69% and 99.16% for C110 and C117, respectively. Genomes were annotated using National Center for Biotechnology Information (NCBI’s) PGAP v6.8 (https://github.com/ncbi/pgap). The 16S rRNA sequence of C110 showed 99.60% identity with Nostoc sp. ATCC 53789 (GenBank CP046703.1) when BLAST against GenBank’s core_nt database and average nucleotide identity (ANI) analysis using FastANI v1.33 (8) gave a score of 96.98%. The 16S rRNA sequence of C117 showed 97.24% identity with Cylindrospermum sp. NIES-4074 (GenBank AP018269.1), and FastANI scored 79.57%.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1de Vries S, de Vries J. 2022. Evolutionary genomic insights into cyanobacterial symbioses in plants. Quant Plant Biol 3:e 16. doi:10.1017/qpb.2022.337077989 PMC 10095879 · doi ↗ · pubmed ↗
- 2Nelson JM, Hauser DA, Li F-W. 2021. The diversity and community structure of symbiotic cyanobacteria in hornworts inferred from long-read amplicon sequencing. Am J Bot 108:1731–1744. doi:10.1002/ajb 2.172934533221 · doi ↗ · pubmed ↗
- 3Nelson JM, Hauser DA, Gudiño JA, Guadalupe YA, Meeks JC, Salazar Allen N, Villarreal JC, Li F-W. 2019. Complete genomes of symbiotic cyanobacteria clarify the evolution of vanadium-nitrogenase. Genome Biol Evol 11:1959–1964. doi:10.1093/gbe/evz 137PMC 664518031243438 · doi ↗ · pubmed ↗
- 4Stanier RY, Deruelles J, Rippka R, Herdman M, Waterbury JB. 1979. Generic assignments, strain histories and properties of pure cultures of cyanobacteria. Microbiology (Reading, Engl) 111:1–61. doi:10.1099/00221287-111-1-1 · doi ↗
- 5Lichtenthaler HK, Wellburn AR. 1983. Determinations of total carotenoids and chlorophylls a and b of leaf extracts in different solvents. Biochem Soc Trans 11:591–592. doi:10.1042/bst 0110591 · doi ↗
- 6Kolmogorov M, Bickhart DM, Behsaz B, Gurevich A, Rayko M, Shin SB, Kuhn K, Yuan J, Polevikov E, Smith TPL, Pevzner PA. 2020. meta Flye: scalable long-read metagenome assembly using repeat graphs. Nat Methods 17:1103–1110. doi:10.1038/s 41592-020-00971-x 33020656 PMC 10699202 · doi ↗ · pubmed ↗
- 7Huang N, Li H. 2023. compleasm: a faster and more accurate reimplementation of BUSCO. Bioinformatics 39:btad 595. doi:10.1093/bioinformatics/btad 59537758247 PMC 10558035 · doi ↗ · pubmed ↗
- 8Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. 2018. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun 9:5114. doi:10.1038/s 41467-018-07641-930504855 PMC 6269478 · doi ↗ · pubmed ↗
