The complete chloroplast genome and phylogenetic analysis of Jasminum lanceolaria Roxb. (Oleaceae)
Xianglan Liang, Guangfu Tian, Song Guo, Chang Liu

TL;DR
This paper presents the full chloroplast genome of Jasminum lanceolaria, a medicinal plant, and shows its close relation to another jasmine species.
Contribution
The study provides the first complete chloroplast genome sequence of Jasminum lanceolaria and its phylogenetic placement.
Findings
The chloroplast genome is 163,015 bp long with 133 genes, including 87 protein-coding genes.
Phylogenetic analysis shows J. lanceolaria is closely related to Jasminum polyanthum.
The genome data offer a reference for taxonomic identification and future research on this species.
Abstract
Jasminum lanceolaria Roxb. (1996) is an important medicinal herb with diverse applications. It is native to China and belongs to the family Oleaceae. In this study, we reported the complete chloroplast genome sequence of J. lanceolaria. The assembled genome has a total length of 163,015 bp, with an overall GC content of 38.87%, and contained 133 genes, including 87 protein-coding genes, 38 tRNA genes, and eight rRNA genes. Phylogenetic analysis revealed that J. lanceolaria was closely related to Jasminum polyanthum. These findings provided valuable genomic resources and a reference for the taxonomic identification of J. lanceolaria.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3- —‘International Training Course on Development of Molecular Markers for Precision Identification of Germplasm Resource’ from Ministry of Sciences and Technology of China
- —Guangxi Laibin City Science Research and Technology Development Plan
- —National Science & Technology Fundamental Resources Investigation Program of China
- —Hunan Provincial Special Project for Construction of Innovation Demonstration Area at Chenzhou City under National Sustainable Development Plan
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Biological and pharmacological studies of plants · Phytochemistry and Biological Activities
Introduction
Jasminum lanceolaria Roxb. (1820) is a species in the Oleaceae family (Quang et al. 2013), commonly known as the olive family, which comprised a diverse range of flowering plants, including economically important species such as olives (Olea europaea) and jasmine (Jasminum). The genus Jasminum consisted of shrubs and vines, and J. lanceolaria was characterized by its lance-shaped leaves and fragrant flowers (Rescigno et al. 2025). This climbing shrub typically occurs in low-altitude thickets from southeastern China to India (Balkrishna et al. 2021). Its stems and roots had traditionally been used to treat rheumatism and fever (Yan et al. 2015), while the leaves served as an anti-inflammatory agent (Sun et al. 2008).
The genus Jasminum contained a diverse array of bioactive compounds, including flavonoids (Sun et al. 2007), lignans, phenolic compounds, and glycosides (Jia-Ming et al. 2009), which exhibited antioxidant, anti-inflammatory, and antitumor activities (Tan et al. 2024). Despite the economic, medicinal, and ornamental importance of J. lanceolaria, its evolutionary relationships to other cogenera species in the Jasminum genus remained poorly understood. Chloroplasts are universal organelles found in plants. They have their own genomes. The relatively conserved structure, uniparental inheritance, and moderate mutation rates make the chloroplst genomes valuable molecular markers for phylogenetic reconstruction and species identification (Wang et al. 2021).
In the present study, we aimed to sequence and annotate the complete chloroplast genome of J. lanceolaria, thereby providing valuable data for phylogenetic analyses and lay the foundation for accurate identification of J. lanceolaria and products.
Materials and methods
Fresh and healthy leaves of J. lanceolaria were collected from South China Botanical Garden, Guangdong Province, China (23°10′46.6″N, 113°21′6.7″E). Pictures of the plant are shown in Figure 1. The plant specimen was identified by GuangFu Tian ([email protected]). The voucher specimen was deposited in the herbarium at the Institute of Medicinal Plant Development under the voucher number JXHC54 (Figure 1).
The Pictures of the individual plant of J. lanceolaria, photographed by guangfu Tian ([email protected]) in South China Botanical Garden, Guangdong, China. (A) Growth form of J. lanceolaria, showing the scandent shrub with glabrous or pubescent stems, terete branchlets, glabrous petioles, opposite leaves, and a terminal leaflet with distinct petiolule. (B) Leaf morphology of J. lanceolaria, showing 3-foliolate leaves with elliptic, ovate, or lanceolate leaflets.
Total genomic DNA was isolated from J. lanceolaria leaf tissues using the Plant Genomic DNA Kit (Tiangen, Beijing, China) following the manufacturer’s protocol. DNA purity was assessed using a NanoDrop spectrophotometer, and DNA concentration was quantified using a Qubit fluorometer. Finally, the DNA integrity was evaluated by agarose gel electrophoresis.
The DNAs were subjected to library construction following the recommendation of the manufacturers (Illumina, USA). Sequencing was performed on a Illumina Xplus platform, generating paired-end reads of 150 bp with a total raw data volume of approximately 10 GB. The raw sequencing reads were processed for quality control by using FastQC for the evaluation overall sequencing quality. And Fastp v0.2 (Chen 2023) was used to remove adapter sequences and low-quality reads. The resulting clean data were then used for the de novo assembly of the complete chloroplast genome by using GetOrganelle v1.7.7.1 (Jin et al. 2020) with default parameters. The resulting circular genome was visualized by using Bandage v0.8.1 (Wick et al. 2015). To evaluate genome coverage, we mapped the sequencing reads back to the assembled chloroplast genome using BWA v0.7.17 (Li 2013), and calculated the coverage depth using Samtools v1.13 (Danecek et al. 2021). Genome annotation was performed by using CPGAVAS2 (Shi et al. 2019), and the annotation of tRNA genes was further verified with tRNAscan-SE v2.0 (Chan et al. 2021). To ensure the accuracy of the annotation, we manually corrected the start and stop codons as well as intron/exon boundaries using Geneious Prime 2025, with the chloroplast genome sequences of Jasminum fluminense (NC_042272.1) and J. tortuosum (NC_034691.1) serving as the reference sequences. The chloroplast genome map, along with the cis-splicing genes and the trans-splicing gene (rps12), was generated by using CPGView (Liu et al. 2023).
For phylogenetic analysis, chloroplast genome sequences from 29 additional species within Oleaceae were retrieved from the GenBank. Two species, Callicarpa longissima (NC_088747.1) and Callicarpa macrophylla (NC_058323.1), were used as outgroup species. Multiple sequence alignments of the complete chloroplast genome sequences were generated using MAFFT v7.490 (Katoh and Standley 2013). We used Modelfinder to determine the best model (https://www.nature.com/articles/nmeth.4285), which was found to be TVM+F + R5. Maximum likelihood (ML) trees were constructed by using IQ-TREE v2.3.3 (Lanfear et al. 2020), under the best-fit model, and node support was assessed with 1000 bootstrap replicates. The resulting phylogenetic tree was visualized by using the Interactive Tree of Life (iTOL) online tool (Letunic and Bork 2024) (https://itol.embl.de/). The complete chloroplast genome sequence of J. lanceolaria was submitted to the GenBank database and was assigned the accession number (PX412911).
Results
Visualization of the assembled chloroplast genome of J. lanceolaria using Bandage indicated a complete and circular structure. The average sequencing depth was 4951.13×, with a minimum coverage of 1313×. The absence of uncovered regions indicated a high degree of assembly completeness and reliability (Figure S1).
The chloroplast genome of J. lanceolaria was 163,015 bp in length and has an overall GC content of 38.87%. It exhibits the typical quadripartite structure, consisting of a large single-copy (LSC) region of 90,557 bp, a small single-copy (SSC) region of 13,088 bp, and a pair of inverted repeat (IR) regions, each 29,685 bp in length.
The GC content varied across different regions: 35.75% in the LSC, 32.36% in the SSC, and 41.46% in the IRs. A total of 133 genes were annotated in the chloroplast genome, including 87 protein-coding genes, 38 transfer RNA (tRNA) genes, and 8 ribosomal RNA (rRNA) genes (Figure 2). Among the protein-coding genes, 10 were identified as cis-splicing genes, each containing a single intron (Figure S2). Additionally, ycf3 gene contained two introns. The rps12 gene was characterized as a trans-splicing gene (Figure S3), similar to those found in the chloroplast genomes of other plant species. The structure of these splicing genes serves as a quality indicator of the annotation. No unusual strucure found for these genes made us to concluded that the genome was annotated correctly.
Schematic map illustrating the overall features of the chloroplast genome of J. lanceolaria. The map comprises six concentric tracks by default. From the center outward: the first track displays dispersed repeats; the second shows long tandem repeats as short blue bars; the third indicates short tandem repeats (microsatellites) as colored bars. The fourth track marks the structural regions of the genome, including SSC, LSC, IRa, and IRb. The fifth track plots the GC content across the genome. The sixth track presents annotated genes, with optional codon usage bias shown in parentheses after gene names. Genes are color-coded based on functional categories, as indicated in the legend at the bottom left. Genes transcribed on the inner and outer circles are oriented clockwise and counterclockwise, respectively.
The phylogenetic analysis revealed that J. lanceolaria was more closely related to species in the genus Chrysojasminum at the intergeneric level, with strong bootstrap support values (Figure 3). At the intrageneric level, J. lanceolaria was most closely related to Jasminum polyanthum (NC_042273.1).
Phylogenetic tree of 29 additional species within oleaceae inferred using the ML method based on complete chloroplast genome sequence. Callicarpa longissima (NC_088747.1) and Callicarpa macrophylla (NC_058323.1) (Liu et al. 2022) was selected as outgroup. Numbers at each node represent the bootstrap values. The position of J. lanceolaria is marked in bold. The sequences used for constructing the phylogenetic tree are as follows: Jasminum multiflorum NC_085217.1 (Xu et al. 2024), Jasminum fluminense NC_042272.1, Jasminum auriculatum NC_085215.1(Xu et al. 2024), Jasminum polyanthum NC_042273.1, Jasminum dichotomum NC_085216.1, Jasminum tortuosum NC_034691.1, Jasminum nervosum NC_085477.1 (Le and Do 2024), Chrysojasminum floridum NC_085219.1, Chrysojasminum odoratissimum NC_085218.1, Olea lancea NC_042278.1, Olea exasperata NC_036985.1, Syringa villosa NC_061378.1, Syringa oblata NC_057990.1 (Yang et al. 2023), Syringa wolfii NC_049090.1 (Liu et al. 2020), Fraxinus velutina NC_082971.1, Fraxinus malacophylla NC_063328.1 (Duan et al. 2020), Fraxinus pennsylvanica NC_043874.1 (Yi et al. 2019), Myxopyrum hainanense NC_047485.1 (Zhu et al. 2020), Forsythia mira NC_046065.1 (Gao et al. 2019), Forsythia suspensa NC_036367.1 (Wang et al. 2017), Chionanthus retusus NC_035000.1 (He et al. 2017), Chionanthus virginicus NC_042447.1 (Wadl et al. 2022), Chionanthus macrobotrys NC_042384.1, Osmanthus cooperi NC_053565.1 (Wang et al. 2019), Osmanthus didymopetalus NC_057502.1 (Zhao et al. 2020), Osmanthus armatus NC_065990.1 (Du et al. 2019), Ligustrum quihoui NC_057246.1 (Wang et al. 2019), Ligustrum obtusifolium NC_071249.1 (Long et al. 2023), Ligustrum sinense NC_071248.1 (Long et al. 2023).
Discussion and conclusion
Here, the chloroplast genome of J. lanceolaria was assembled and annotated for the first time, with a length of 163,015 bp and a total of 133 genes. This genome was comparable in size (159,404-165,352 bp) (Xu et al. 2024) to other Jasminum species, exhibiting a structure and gene composition that reflected the evolutionary conservation of plastid genomes within the genus.
Phylogenomic analyses based on complete chloroplast genomes yielded results that were largely consistent with previous phylogenies (Le and Do 2024; Xu et al. 2024). At the intergeneric level, J. lanceolaria was closely related to members of the genus Chrysojasminum, whereas within the Jasminum genus, it was clustered most closely with J. polyanthum. According to the most recent classification system of Oleaceae (Wallander and Albert 2000; Green 2004; The Angiosperm Phylogeny Group 2016), species within each genus formed a well-supported monophyletic clade, and the phylogenetic tree inferred from complete chloroplast genome sequences effectively distinguished species and reliably resolved their phylogenetic relationships.
To further refine the phylogenetic framework within Jasminum, future studies integrating nuclear genomic data, such as transcriptomes or low-copy nuclear genes, and broader taxon sampling will be valuable. Overall, the complete chloroplast genome of J. lanceolaria provided a valuable genomic resource for phylogenetic studies and offers important data for species identification.
Supplementary Material
Supplemental Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Balkrishna A et al. 2021. Mechanistic insight into antimicrobial and antioxidant potential of Jasminum species: a herbal approach for disease management. Plants. 10(6):1089. 10.3390/plants 1006108934071621 PMC 8227019 · doi ↗ · pubmed ↗
- 2Chan PP, Lin BY, Mak AJ, Lowe TM. 2021. t RN Ascan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res. 49(16):9077–9096. 10.1093/nar/gkab 68834417604 PMC 8450103 · doi ↗ · pubmed ↗
- 3Chen SF. 2023. Ultrafast one‐pass FASTQ data preprocessing, quality control, and deduplication using fastp. Imeta. 2(2):e 107. 10.1002/imt 2.10738868435 PMC 10989850 · doi ↗ · pubmed ↗
- 4Danecek P et al. 2021. Twelve years of SA Mtools and BC Ftools. Gigascience. 10(2):giab 008. 10.1093/gigascience/giab 00833590861 PMC 7931819 · doi ↗ · pubmed ↗
- 5Du Y, Li R, Zhang Q. 2019. Complete plastome sequence of Osmanthus armatus and phylogenetic implications in Oleaceae. Mitochondrial DNA B Resour. 4(2):3083–3084. 10.1080/23802359.2019.166668633365865 PMC 7706894 · doi ↗ · pubmed ↗
- 6Duan HC et al. 2020. The complete chloroplast genome of Fraxinus malacophylla (Oleaceae, Oleoideae). Mitochondrial DNA B Resour. 5(3):3588–3589. 10.1080/23802359.2020.182799933367021 PMC 7594738 · doi ↗ · pubmed ↗
- 7Gao S et al. 2019. The complete chloroplast genome of Forsythia mira, an endemic medicinal shrub in China. Mitochondrial DNA B Resour. 5(1):57–58. 10.1080/23802359.2019.169392433366420 PMC 7720978 · doi ↗ · pubmed ↗
- 8Green P. 2004. Oleaceae. In: Flowering plants dicotyledons: Lamiales (except Acanthaceae including Avicenniaceae). Springer; p 296–306. 10.1007/978-3-642-18617-2 · doi ↗
