The complete chloroplast genome sequence of Bolbitis angustipinna (Hayata) H. Ito (Dryopteridaceae)
Lin Luo, Sai Wang, Huajing Zhou, Yuhan Fang, Tuantuan Wang, Huilong Ou, Jia Xie, Faguo Wang

TL;DR
This paper reports the complete chloroplast genome of a rare fern species and compares it with other ferns to understand its evolutionary relationships.
Contribution
The study provides the first complete chloroplast genome sequence of Bolbitis angustipinna and its phylogenetic placement among ferns.
Findings
The chloroplast genome is 152,562 base pairs long with specific region sizes and gene counts.
Phylogenetic analysis shows B. angustipinna is closely related to B. deltigera.
The genome includes 115 genes, offering insights into fern evolution.
Abstract
Bolbitis angustipinna (Hayata) H. Ito is a rare ornamental fern primarily distributed throughout East Asia and Southeast Asia. Its complete chloroplast genome was assembled, and a phylogenetic tree was constructed using 15 fern species. The genome is 152,562 base pairs, with a large single-copy region (82,712 bp), a small single-copy region (21,403 bp), and two inverted repeat regions (24,208 bp). It encodes 115 genes, including 72 protein-coding genes, 35 tRNA genes, and 8 rRNA genes. Phylogenetic analysis shows that B. angustipinna is most closely related to B. deltigera, providing valuable genetic and evolutionary insights.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3- —National Natural Science Foundation of China10.13039/501100001809
- —National Key Research & Development Program of China
- —Special Fund Project for Environmental Protection of Hainan Province – Investigation and Assessment of Biodiversity in Baisha Li Autonomous County
- —Hainan Provincial Research & Development Program
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFern and Epiphyte Biology · Genomics and Phylogenetic Studies · Biological and pharmacological studies of plants
Introduction
The Dryopteridaceae is a prominent fern family, with species that play critical roles in ecology, phylogeny, and conservation biology (Zhang et al. 2013; Liu et al. 2016). The genus Bolbitis, which belongs to this family, comprises approximately 80 species (Moran et al. 2010; PPGI 2016). Bolbitis angustipinna (Hayata) H. Ito (1938), a representative species of this genus, typically grows along streams in dense forests, and on rocks in tropical and subtropical areas (Dong and Zhang. 2005). Known for its abundant lateral pinnae (15–20 pairs), B. angustipinna forms an aesthetically pleasing plant shape, making it a popular choice for landscape design and garden decoration.
In addition to its ornamental value, studies on the embryonic development of B. angustipinna have highlighted the significance of its sexual generation, with observation of morphological features such as genitals and sperm providing valuable insights into the phylogenetic relationships within Bolbitis (Zhao et al. 2014). Despite this, there have been few reports on the chloroplast genome of B. angustipinna, and such genomic data could offer critical insights into its classification, evolutionary relationships, and adaptability.
Notably, B. angustipinna is listed as a vulnerable (VU) species in the ‘China Biodiversity Red List-Higher Plants Volume’ (Ministry of Environmental Protection of the People's Republic of China 2013), and is highly sensitive to the environment, which underscores the importance of its conservation and the need for further research. This study aims to sequence, assemble, and analyze the complete chloroplast genome of B. angustipinna, focusing on its genome structure, the characteristics of its chloroplast genome’s coding and non-coding regions, and its potential implications for plant phylogeny and horticultural applications. The results also contribute valuable reference material for the conservation of this species and other biodiversity efforts.
Materials and methods
Fresh leaves of B. angustipinna were collected from Limu Mountain in Qiongzhong County, Hainan Province (109.70674281°N, 18.70850097°E; altitude: 670 m). The plant specimens are stored in the herbarium of the South China Botanical Garden (http://herbarium.scbg.cas.cn/, Feiyan Zeng, [email protected]), with specimen number WFG5608-1 (Figure 1(A,B)). Fresh plant samples were stored in liquid nitrogen for preservation. Genomic DNA was extracted from the leaf tissues using the CTAB method (Doyle 1987) and sequenced on the Illumina HiSeq X-Ten platform (Illumina Inc., San Diego, CA).
Species reference image of B. angustipinna. (A) Plant shape of B. angustipinna. (B) Morphological characteristics offertile fronds of B. angustipinna. The plant’s sterile fronds are elliptic and pinnate. The terminal pinna is linear-lanceolate with a subterminal bulbil. The lateral pinnae are alternate and linear-lanceolate. Sporangia inserted usually throughout abaxial surface, arrangement fully acrostichoid, sometimes with a sterile strip along costa. The species photo was taken by Faguo Wang in Limu Mountain in Qiongzhong County, Hainan Province, without any copyright issues.
The resulting raw sequences were assembled using the GetOrganelle 1.7.5 toolkit, which relies on SPAdes 3.11 and Bowtie2 for sequence assembly and alignment (Langmead and Salzberg 2012; Bankevich et al. 2012; Jin et al. 2020). The genome coverage depth was determined using SAMtools v1.21 (Li et al. 2009), while the sequencing depth and coverage map were generated using ggplot2 (Ito and Murphy 2013) in R (Figure S1), which provided a view of the sequencing efficiency. The complete chloroplast genome was annotated and visualized using the CpGAVAS2 online tool (Shi et al. 2019) and CPGView (Shi et al. 2019), respectively, and the annotation was manually verified and refined in Geneious 9.0.2. The final annotated chloroplast genome was used to generate a circular diagram via Chloroplot (Zheng et al. 2020). The complete chloroplast genome sequence has been submitted to GenBank under accession number PP471972.
To determine the phylogenetic position of B. angustipinna, we first conducted a BLAST search of its chloroplast genome against the NCBI nucleotide database (Altschul et al. 1990) to gather related sequences for constructing a phylogenetic tree (Figure 2). Chloroplast DNA sequences were aligned using MAFFT v.7 (Katoh and Standley 2013). These alignments were concatenated and analyzed using maximum-likelihood (ML) and Bayesian inference (BI) methods in RAxML v.7 (Stamatakis 2014) and MrBayes v3.2.1 (Ronquist et al. 2012), respectively, to generate the phylogenetic tree.
Maximum-likelihood and Bayesian phylogeny of B. angustipinna and related taxa based on 15 complete chloroplast genomes, with Tectaria decurrens from the Aspidiaceae family as the outgroup. Bootstrap values are indicated at each node. The target genome of Bolbitis angustipinna is shown in red. The following sequences were used: Bolbitis sinensis (NC071924), Bolbitis x multipinna (NC064057), Bolbitis aliena (NC071960), Bolbitis tonkinensis (NC071958), Bolbitis laxireticulata (NC061668), Bolbitis subcordata (MN623358) (Liu et al. 2020), Bolbitis hekouensis (NC071934), Bolbitis deltigera (NC071959), Elaphoglossum marginatum (MZ727565), Elaphoglossum yoshinagae (NC070104), Dryopteris podophylla (NC084317), Dryopteris championii (NC084316), Dryopteris decipiens (NC035854), and Tectaria decurrens (MN623363) (Liu et al. 2020).
Results
The raw sequencing data for B. angustipinna amounted to about 6 Gb. The complete chloroplast genome is 152,562 base pairs (bp) in length, and the assembled genome achieved an average sequencing depth of 339.45×, with a minimum depth of 10× and a maximum depth of 609× (Figure S1). The genome comprises a large single-copy (LSC) region of 82,712 bp, a small single-copy (SSC) region of 21,434 bp, and a pair of inverted repeat regions (IRa and IRb) of 24,208 bp. The total GC content of the chloroplast genome is 42%, with the highest GC content observed in the IR regions (45.2%), followed by the LSC region (41.1%), and the lowest GC content in the SSC region (38.4%).
The chloroplast genome encodes 115 genes, including 72 protein-coding genes, 35 tRNA genes, and 8 rRNA genes. Among these, 12 genes are duplicated, including 4 rRNA-encoding genes (rrn4.5S, rrn5S, rrn16S, rrn23S), 5 tRNA-encoding genes (trnA-UGC, trnI-GAU, trnH-GUG, trnR-ACG, trnN-GUU), and 3 protein-coding genes (rps12, psaI, psbA) (Figure 3). Furthermore, 6 cis-spliced genes (rps16, atpF, clpP, petB, rpl16, ndhA) and the trans-spliced gene rps12 were identified in the chloroplast genome (Figures S2 and S3).
Gene map of chloroplast genome of B. angustipinna (PP471972). In the outermost circle, genes are represented by colored blocks of varying sizes. The intermediate circle illustrates the variation in GC content across different genomic regions. Arrows positioned on the inner and outer sides of this circle indicate genes transcribed in clockwise and counterclockwise directions, respectively. The innermost circle highlights the quadripartite structure – comprising the large single-copy (LSC), small single-copy (SSC), and inverted repeat regions (IRa and IRb) – using distinct colors to demarcate their respective boundaries, with lengths indicated.
Phylogenetic analysis revealed that B. angustipinna is most closely related to B. deltigera, with strong support [ML bootstrap support (MLBS): 100%, posterior probability (PP): 1]. Moreover, the analysis identified Bolbitis and Elaphoglossum as sister taxa (MLBS: 100%, PP: 1), which is consistent with previous studies (Moran et al. 2010).
Conclusions and discussion
In this study, we present the complete chloroplast genome of Bolbitis angustipinna, with a total length of 152,562 bp. The genome consists of an LSC, an SSC, and two IRa and IRb regions. Functional annotation revealed 115 genes, including 72 protein-coding genes, 35 tRNA genes, and 8 rRNA genes, with 12 duplicated genes. This genome shows a gene content and structure similar to that of related species.
The phylogenetic analysis of B. angustipinna revealed its closest relationship to B. deltigera, with strong support from both maximum-likelihood bootstrap values (MLBS: 100%) and posterior probability (PP: 1). This supports the notion that B. angustipinna and B. deltigera are closely related within the phylogenetic tree. Additionally, our analysis identified Bolbitis and Elaphoglossum as sister taxa, a relationship that is robustly supported (MLBS: 100%, PP: 1), aligning with previous studies in the field (Moran et al. 2010). The study provides evidence for the systematics of B. angustipinna by constructing phylogenetic trees and analyzing their classification positions. Furthermore, the embryonic development pattern of B. angustipinna is regarded as a hallmark feature in fern taxonomy and phylogenetic research, which aids in elucidating the evolutionary pathways of related groups within the families Bolbitidaceae and Dryopteridaceae.
In summary, we have constructed the complete chloroplast genome sequence and phylogenetic tree of Bolbitis angustipinna, revealing its genetic characteristics and providing further insights into the phylogenetic relationships among populations of this plant species. This work offers data support for future fern species conservation efforts.
Supplementary Material
Supplemental Material.pdf
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol. 215(3):403–410. 10.1016/S 0022-2836(05)80360-22231712 · doi ↗ · pubmed ↗
- 2Bankevich A et al. 2012. SP Ades: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 19(5):455–477. 10.1089/cmb.2012.002122506599 PMC 3342519 · doi ↗ · pubmed ↗
- 3Doyle JJ. 1987. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 19:11–15.
- 4Ito K, Murphy D. 2013. Application of ggplot 2 to pharmacometric graphics. CPT Pharmacometr Syst Pharmacol. 2(10):e 79–e 16. 10.1038/psp.2013.56PMC 381737624132163 · doi ↗ · pubmed ↗
- 5Jin JJ et al. 2020. Get Organelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21(1):241. 10.1186/s 13059-020-02154-532912315 PMC 7488116 · doi ↗ · pubmed ↗
- 6Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30(4):772–780. 10.1093/molbev/mst 01023329690 PMC 3603318 · doi ↗ · pubmed ↗
- 7Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods. 9(4):357–359. 10.1038/nmeth.192322388286 PMC 3322381 · doi ↗ · pubmed ↗
- 8Li H et al. 2009. The sequence alignment/map format and SA Mtools. Bioinformatics. 25(16):2078–2079. 10.1093/bioinformatics/btp 35219505943 PMC 2723002 · doi ↗ · pubmed ↗
