Genetic diversity and population structure of an insect‐pollinated and bird‐dispersed dioecious tree Magnolia kwangsiensis in a fragmented karst forest landscape
Yanfang Lin, Yingying Xiang, Sujian Wei, Qiwei Zhang, Yanhua Liu, Zhiyong Zhang, Shaoqing Tang

TL;DR
This study examines the genetic diversity and dispersal patterns of Magnolia kwangsiensis in fragmented forests to guide its conservation.
Contribution
The study identifies two distinct genetic groups and highlights the role of pollen and seed dispersal in maintaining genetic connectivity.
Findings
High genetic diversity was observed with average heterozygosities of 0.726 and 0.687.
Pollen and seed dispersal help maintain genetic connectivity among populations.
Two genetically differentiated groups correspond to disjunct regions and should be treated as separate conservation units.
Abstract
This study combined population genetics and parentage analysis to obtain foundational data for the conservation of Magnolia kwangsiensis. M. kwangsiensis is a Class I tree species that occurs in two disjunct regions in a biodiversity hotspot in southwest China. We assessed the genetic diversity and structure of this species across its distribution range to support its conservation management. Genetic diversity and population structure of 529 individuals sampled from 14 populations were investigated using seven nuclear simple sequence repeat (nSSR) markers and three chloroplast DNA (cpDNA) fragments. Parentage analysis was used to evaluate the pollen and seed dispersal distances. The nSSR marker analysis revealed a high genetic diversity in M. kwangsiensis, with an average observed (Ho) and expected heterozygosities (He) of 0.726 and 0.687, respectively. The mean and maximum pollen and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
FIGURE 1
FIGURE 2
FIGURE 3
FIGURE 4
FIGURE 5
FIGURE 6
FIGURE 7
FIGURE 8| Locality | Population code | Longitude | Latitude | Elevation (m) | Sample size SSR (cpDNA) | |
|---|---|---|---|---|---|---|
| Eastern | Dongkuai, Huanjiang, Guangxi | DK | 107.98306° | 25.13750° | 637 | 119 (10) |
| Dongzai, Huanjiang, Guangxi | DZ | 107.98222° | 25.13306° | 696 | 106 (10) | |
| Changdong, Huanjiang, Guangxi | CD | 107.97806° | 25.15444° | 650 | 37 (10) | |
| Bannan, Huanjiang, Guangxi | BN | 107.97028° | 25.05361° | 535 | 35 (10) | |
| Hongdong, Huanjiang, Guangxi | HD | 107.97944° | 25.11583° | 700 | 32 (10) | |
| Kudong, Huanjiang, Guangxi | KD | 107.97167° | 25.12694° | 675 | 38 (10) | |
| Xiabaishan, Huanjiang, Guangxi | XBS | 107.93750° | 25.14528° | 825 | 37 (10) | |
| Houmeishan, Huanjiang, Guangxi | HMS | 107.92278° | 25.14167° | 701 | 36 (10) | |
| Jiabadong, Huanjiang, Guangxi | JBD | 108.00833° | 25.18444° | 650 | 16 (10) | |
| Luocheng, Guangxi | LC | 108.81861° | 24.86556° | 450 | 31 (10) | |
| Pailang, Maolan, Guizhou | PL | 107.93556° | 25.25806° | 750 | 24 (10) | |
| Ladongning, Maolan, Guizhou | LDN | 107.89556° | 25.17028° | 750 | 35 (10) | |
| Donglou, Maolan, Guizhou | DL | 107.91306° | 25.30056° | 700 | 20 (10) | |
| Western | Gulinjin, Maguan, Yunnan | GLQ | 103.91139° | 22.70861° | 847 | 26 (10) |
| Total | – | – | – | – | – | 592 (140) |
| Pop. | Haplotypes (number) | SSR | Wilcoxon test | Mode‐shift | |||
|---|---|---|---|---|---|---|---|
| Na | Ne | Ho | He | ||||
| DK | H1 (10) | 8.571 | 3.677 | 0.748 | 0.701 | 0.766 | L‐shaped |
| DZ | H1 (10) | 8.571 | 4.205 | 0.805 | 0.752 | 0.148 | L‐shaped |
| CD | H1 (10) | 7.286 | 3.968 | 0.788 | 0.729 | 0.148 | L‐shaped |
| BN | H1 (10) | 5.143 | 2.734 | 0.522 | 0.568 | 0.406 | L‐shaped |
| HD | H1 (10) | 6.714 | 3.807 | 0.817 | 0.723 | 0.055 | L‐shaped |
| KD | H1 (10) | 6.143 | 3.281 | 0.718 | 0.689 | 0.148 | L‐shaped |
| XBS | H1 (10) | 6.429 | 3.907 | 0.676 | 0.684 | 0.406 | L‐shaped |
| HMS | H1 (10) | 6.857 | 3.631 | 0.675 | 0.699 | 0.234 | L‐shaped |
| JBD | H1 (10) | 5.714 | 3.536 | 0.821 | 0.696 | 0.027 | L‐shaped |
| LC | H1 (10) | 6.143 | 3.185 | 0.664 | 0.649 | 0.148 | L‐shaped |
| PL | H3 (10) | 6.429 | 3.679 | 0.756 | 0.700 | 0.289 | L‐shaped |
| LDN | H1 (10) | 7.429 | 3.052 | 0.665 | 0.662 | 0.973 | L‐shaped |
| DL | H1 (10) | 5.429 | 3.597 | 0.779 | 0.697 | 0.148 | L‐shaped |
| GLQ | H2 (10) | 6.429 | 3.328 | 0.731 | 0.667 | 0.188 | L‐shaped |
| Mean | – | 6.663 | 3.542 | 0.726 | 0.687 | – | – |
| DK | DZ | CD | BN | HD | KD | XBS | HMS | JBD | LC | PL | LDN | DL | GLQ | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DK | – | 0.043 | 0.040 | 0.052 | 0.059 | 0.051 | 0.052 | 0.022 | 0.065 | 0.058 | 0.078 | 0.032 | 0.045 | 0.126 |
| DZ | 5.603 | – | 0.029 | 0.064 | 0.046 | 0.048 | 0.060 | 0.039 | 0.059 | 0.045 | 0.055 | 0.060 | 0.043 | 0.086 |
| CD | 5.971 | 8.492 | – | 0.055 | 0.045 | 0.070 | 0.063 | 0.032 | 0.047 | 0.065 | 0.049 | 0.055 | 0.040 | 0.112 |
| BN | 4.533 | 3.677 | 4.258 | – | 0.092 | 0.085 | 0.087 | 0.046 | 0.087 | 0.055 | 0.102 | 0.068 | 0.089 | 0.176 |
| HD | 4.001 | 5.241 | 5.357 | 2.468 | – | 0.088 | 0.066 | 0.060 | 0.079 | 0.090 | 0.068 | 0.067 | 0.062 | 0.117 |
| KD | 4.661 | 4.932 | 3.315 | 2.675 | 2.579 | – | 0.070 | 0.053 | 0.072 | 0.077 | 0.075 | 0.075 | 0.075 | 0.121 |
| XBS | 4.577 | 3.914 | 3.703 | 2.631 | 3.51 | 3.336 | – | 0.045 | 0.090 | 0.057 | 0.051 | 0.076 | 0.043 | 0.120 |
| HMS | 11.23 | 6.116 | 7.508 | 5.208 | 3.888 | 4.477 | 5.362 | – | 0.061 | 0.046 | 0.078 | 0.030 | 0.041 | 0.118 |
| JBD | 3.6 | 4.006 | 5.048 | 2.612 | 2.915 | 3.203 | 2.525 | 3.853 | – | 0.090 | 0.082 | 0.095 | 0.077 | 0.125 |
| LC | 4.065 | 5.25 | 3.589 | 4.287 | 2.516 | 3.014 | 4.137 | 5.188 | 2.542 | – | 0.080 | 0.075 | 0.076 | 0.117 |
| PL | 2.969 | 4.256 | 4.815 | 2.198 | 3.401 | 3.062 | 4.685 | 2.968 | 2.787 | 2.892 | – | 0.101 | 0.060 | 0.128 |
| LDN | 7.508 | 3.941 | 4.257 | 3.411 | 3.491 | 3.097 | 3.023 | 8.072 | 2.384 | 3.103 | 2.221 | – | 0.054 | 0.136 |
| DL | 5.366 | 5.551 | 6.057 | 2.558 | 3.774 | 3.103 | 5.532 | 5.8 | 2.996 | 3.036 | 3.896 | 4.386 | – | 0.110 |
| GLQ | 1.73 | 2.655 | 1.99 | 1.171 | 1.888 | 1.816 | 1.84 | 1.871 | 1.745 | 1.881 | 1.709 | 1.585 | 2.016 | – |
| Source of variation | d.f. | Sum of squares | Variance components | Percentage variation (%) |
|---|---|---|---|---|
| Among groups | 1 | 51.7 | 0.353 Va | 11.46 |
| Among populations within groups | 12 | 285.4 | 0.255 Vb | 8.29 |
| Within populations | 1170 | 2886.9 | 2.467 Vc | 80.24 |
| Total | 591 | 3224.0 | 3.075 |
| Species | Na | Ho | He |
| Method | References |
|---|---|---|---|---|---|---|
|
| 12.6 | 0.726 | 0.687 | 0.125 | SSR | This paper |
|
| 22.1 | 0.666 | 0.719 | 0.185 | SSR | Tamaki et al. ( |
|
| 1–7.2 | 0–0.6 | 0–0.664 | 0.636 | SSR | Kikuchi and Osone ( |
|
| 2.8–5.6 | 0.375–0.586 | 0.442–0.622 | 0.140 | SSR | Kikuchi and Osone ( |
|
| – | 0.349 | 0.307 | 0.340 | SSR | Fan et al. ( |
|
| 27.8 | – | 0.782 | 0.133 | SSR | Tamaki et al. ( |
|
| – | 0.21 | 0.52 | 0.236 | EST‐SSR | Wang et al. ( |
|
| 8.2 | 0.457 | – | 0.171 | SSR | Sun et al. ( |
|
| 4.091 | 0.412 | 0.505 | 0.090 | SSR | Zhao et al. ( |
|
| 14.375 | 0.686 | 0.718 | 0.139 | SSR | Deng et al. ( |
|
| 4.329 | 0.554 | 0.537 | 0.058 | SSR | Zhang et al. ( |
|
| 15.571 | 0.445 | 0.536 | 0.185 | SSR | Xiao et al. ( |
|
| 2.604 | 0.451 | 0.423 | 0.425 | SSR | Xiong et al. ( |
|
| 15.417 | 0.197 | 0.6 | 0.327 | SSR | Yang et al. ( |
|
| 2.707 | 0.262 | 0.399 | 0.302 | SSR | Zhou et al. ( |
- —Guangxi Natural Science Foundation 10.13039/100012547
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic diversity and population structure · Plant and Fungal Species Descriptions · Natural product bioactivities and synthesis
INTRODUCTION
1
The Karst Area in Southern China (KASC) harbors important reservoirs or “arks” of tropical and subtropical biodiversity, characterized by high levels of species diversity, endemism, and endangered plants (Clements et al., 2006; Yang et al., 2021). In a typical karst ecosystem, soil nutrients and moisture exhibit heterogeneity along topographic gradients, resulting in several karst microhabitats (Shen et al., 2019). The emergence of these karst ecological niches leads to the speciation and diversification of the karst flora (Clements et al., 2006). Plants that grow on limestone karst must adapt to its shallow soil, high Ca^2+^ and Mg^2+^ levels, and low water‐holding capacity, among other harsh challenges (Geekiyanage et al., 2019; Hao et al., 2015; Nie et al., 2011; Zhang et al., 2011). Extreme substrates, such as limestone, are often highly fragmented, restricting the distribution of many plant species to specialized substrates that tend to occur in spatially isolated patches, leading to small population sizes (Corlett & Tomlinson, 2020; Fan et al., 2017; Laanisto et al., 2012; Tamme et al., 2010). Moreover, human disturbance and climate change are adversely impacting this fragile ecosystem (Liu et al., 2023; Seidl et al., 2017). Thus, the endemic karst plant species are at risk of extinction due to several biotic and abiotic stressors (Duan et al., 2021; Li et al., 2013).
The fragmentation of plant populations, especially those of rare and endemic endangered species, might lead to inbreeding depression, fixation of deleterious mutations, and loss of adaptive genotypes (Aguilar‐Aguilar et al., 2023; Isagi et al., 2007; Martín‐Hernanz et al., 2019). Furthermore, long‐term fragmentation of plant populations is predicted to increase the chances of population differentiation due to genetic drift (Aguilar et al., 2008; Schultz & Scofield, 2009). The enhanced isolation of fragmented populations can affect pollen and seed dispersals, altering the patterns of gene flow and, thereby, impacting the genetic diversity among these populations. Previous studies have shown that strong geographical isolation by fragmentation can reduce pollen and seed migration and increase inbreeding (Gong et al., 2010); however, some studies have reported contrasting results (Gong & Gong, 2016; Wang et al., 2010; White et al., 2002). For instance, a previous study showed that the fragmented populations of the dioecious karst tree Eurycorymbus cavaleriei, a species visited by a broad spectrum of insect pollinators, can maintain genetic connectivity via long‐distance pollen gene flow (Wang et al., 2010). Differential patterns of pollen flow within fragmented populations are usually attributed to the composition and/or abundance of their pollinators and the foraging behavior of animals (Wang et al., 2010). Moreover, genetic connectivity among plant populations in natural landscapes is also determined by the distance and patterns of dispersed seeds (Ozawa et al., 2013). These studies suggested that different plant species exhibit varying responses to a fragmented landscape.
Magnolia kwangsiensis Figlar & Noot [Woonyoungia septentrionalis (Dandy) Y. W. Law] is a dioecious tree endemic to the limestone karst of China (Figlar & Nooteboom, 2004; Wu et al., 2008; Yang, Guo & Zailiu, 2018). This species currently occurs in two disjunct regions. One of these regions is located at the border of Guangxi and Guizhou, where populations are predominantly distributed in the Mulun National Nature Reserve and Maolan National Nature Reserve. This region comprises 72 fragmented distribution sites with a total of 5162 M. kwangsiensis individuals, with 29 sites harboring <5 individuals (Ling et al., 2021; Liu et al., 2020). The other disjunct region is in southeastern Yunnan (Yang, Guo & Zailiu, 2018). Due to its restricted geographic distribution, high habitat specificity, and slow natural regeneration (Fu et al., 2009, 2012; Ling et al., 2021), M. kwangsiensis has been listed in the Class I National Key Protected Wild Plant List of China (https://www.forestry.gov.cn/). Previously, a genetic structure analysis of five M. kwangsiensis populations, based on amplified fragment length polymorphism (AFLP), revealed significant genetic differentiations among the populations (Zhao et al., 2010). The dioecious nature of M. kwangsiensis, together with its pollination strategy (insect‐transmitted) and bird seed‐dispersal (Lai, Pan et al., 2007; Wang et al., 2019), can facilitate gene flow among the plant populations. Thus, a comprehensive understanding of the levels and patterns of genetic diversity in M. kwangsiensis across its entire distribution range is warranted.
In the present study, we analyzed the genetic diversity and population structure of M. kwangsiensis using seven nuclear simple sequence repeat (nSSR) markers and three chloroplast DNA (cpDNA) fragments. The study aimed to (i) characterize the level of genetic diversity in M. kwangsiensis, (ii) measure the pollen and seed gene flow distances and the patterns of genetic variability within M. kwangsiensis populations across a fragmented karst forest landscape, and (iii) discuss possible implications of these population genetic data for the management and conservation of M. kwangsiensis.
MATERIALS AND METHODS
2
Sample collection and DNA extraction
2.1
We collected a total of 592 samples from 14 M. kwangsiensis populations in 2011, with 20–119 individuals per population. Of these populations, nine were located in the Mulun National Nature Reserve, Huanjiang County, Guangxi Province; three in the Maolan National Nature Reserve, Lipo County, Guizhou Province; one in Luocheng County, Guangxi Province; and one in the Gulinjing Provincial Nature Reserve, Maguan County, Yunnan Province. The populations in the Guangxi and Guizhou Provinces were defined as the eastern populations, and the population in Yunnan was defined as the western population. The details about these regions and their M. kwangsiensis populations are depicted in Table 1 and Figure 1a–c.
Geographical distribution of the Magnolia kwangsiensis populations (a, b). Photographs of the habitat and male flower (inset) of M. kwangsiensis (c). The network of M. kwangsiensis haplotypes based on chloroplast DNA (cpDNA) (d). The gray‐shaded areas on the map are karst landforms. The remaining colors represent the haplotypes. This map was made based on the standard map No. GS (2019) 1822 downloaded from the standard map service website of the Ministry of Natural Resources of the People's Republic of China. The base map is open‐access and has not been modified. The geological data of the southwestern karst region was obtained from the National Earth System Science Data Center, National Science & Technology Infrastructure of China.
The position of all M. kwangsiensis individuals were located in DK and DZ and their junctions. A total of 332 plants were identified, including 67 male‐flowering, 63 female‐flowering, 65 saplings (diameter at breast height (DBH) < 2.5 cm), and 137 non‐flowering adult individuals that year (DBH > 2.5 cm). Leaves from all individuals and 329 mature seeds from seven seed‐trees were collected and used for the parentage analysis (Figure 2). The collected seeds were soaked in laundry detergent for 6–8 h, washed to remove the aril, stored in sterile sand, and germinated in a greenhouse (Lai, Huang et al., 2007). Their germination rate was 100%. Total genomic DNA was extracted from fresh leaves (dried using silica gel) using the cetyltrimethylammonium bromide (CTAB) method previously described by Doyle and Doyle (1987).
The distribution of Magnolia kwangsiensis individuals in the DK and DZ populations used for parentage analysis.
Seven nSSR markers (Ksep01, Ksep05, Ksep07, Ksep09, M6D3, stm0310, and stm0246) were selected for this study (see Appendix 1). M6D3, stm0310, and stm0246 and the remaining four markers were developed for Magnolia obovata, Magnolia stellata, and M. kwangsiensis, respectively (Isagi et al., 1999; Lin et al., 2011; Setsuko et al., 2005). The PCR mixture for each sample comprised 2 μL of 10× PCR buffer, 0.4 μL of MgCl_2_ (25 mM), 0.4 μL of dNTPs (10 mΜ), 0.2 μL of each primer (50 μM), 0.2 μL of rTaq polymerase (all these reagents were obtained from TaKaRa, China), 1 μL of sample DNA (20–30 ng), and 15.6 μL of ddH_2_O (total volume: 20 μL). The PCR protocol was as follows: An initial denaturation step at 94°C for 10 min, followed by 30 cycles of denaturation at 94°C for 45 s, annealing at the appropriate temperature for 45 s for each primer pair, and an extension at 72°C for 45 s. A final elongation step was performed for each reaction at 72°C for 10 min. Allele and genotype frequencies for each locus were obtained from the data reads using a 6% polyacrylamide denaturing gel electrophoresis with a 10 bp DNA ladder (Invitrogen) as the reference. The bands were visualized via silver staining.
Amplifying and sequencing the intergenic regions in chloroplast DNA
2.2
We amplified and sequenced cpDNA fragments from five individuals in each population. A total of 15 primer pairs were used (see Appendix 1). Among these primers, those with polymorphic loci were used to amplify and sequence the cpDNA from five more individuals from each population. The primer pairs were derived from the large and small single‐copy (LSC and SSC) regions previously described by Shaw et al. (2005, 2007). PCR mixture for each sample comprised 5 μL of 10× PCR buffer (containing Mg^2+^), 4 μL of dNTPs (2.5 mM), 0.5 μL of each forward and reverse primers (50 μM), 1 U rTaq polymerase (all reagents from TaKaRa), 1 μL of DNA solution (20–30 ng), and 38.5 μL of ddH_2_O (total volume: 50 μL). The PCR amplifications were performed in a thermocycler using the same protocol used for nSSR amplification described in Section 2.1. The PCR products were directly purified and sequenced using a 3730XL DNA Analyzer (Applied Biosystems).
Parentage analysis
2.3
Parentage analysis was conducted using the CERVUS v3.0 software based on the exclusion and maximum likelihood methods (Corlett & Tomlinson, 2020) to estimate pollen and seed distribution rates for DK and DZ, within a 500 m × 600 m plot (Jones et al., 2010). In the exclusion method, zero mismatches were accepted. For the likelihood method, five parameters were set as follows: Minimum number of matching loci: 7, error rate: 0.01, number of candidate parents: 130 (including 63 mother and 67 father trees), the proportion of candidate parents sampled: 0.95, and proportion of loci typed: 0.99 (Garcia et al., 2005). The critical log odds ratio (LOD) scores with 95% (strict) and 80% (relaxed) levels of confidence were calculated based on 10,000 simulations. The most likely father or mother was assigned when the confidence level was >80%, and the pair loci mismatching was 0.
Magnolia kwangsiensis is a dioecious tree (Wu et al., 2008). Given that the mother tree of the harvested seeds is known, the effective pollen dispersal distance was determined by calculating the distance between the two parents when the father of the seeds was identified using parentage analysis of the plants with male flowers in the current year. Similarly, when the mother tree of the saplings was identified through parentage analysis of the plants with female flowers in the current year, the effective seed dispersal distance was determined by calculating the distance of the parent from the saplings.
SSR analysis
2.4
The Hardy–Weinberg equilibrium and linkage disequilibrium (LD) tests were conducted using the GENEPOP v4.1 software (Raymond & Rousset, 1995). The exact level test was performed using a Markov‐chain random program with the following parameters: 10,000 dememorizations, 5000 batches, and 5000 iterations per batch.
Diversity measures, such as number of alleles per locus (Na), effective number of alleles (Ne), observed heterozygosity (Ho), expected heterozygosity (He), and the genetic differentiation coefficient (F ST), were calculated using the GenAIEx v6.5 software (Peakall & Smouse, 2012). A principal coordinates analysis (PCoA) was undertaken using GenAlEx to derive the patterns of genetic differentiation based on pairwise F ST values among the populations. Gene flow (Nm), based on F ST values, was calculated using the formula Nm = [(1 − F ST)/4F ST] based on the migration‐drift equilibrium hypothesis (Wright, 1970). Next, we performed a Mantel test using GenAIEx v6.5 to assess whether genetic differentiation was primarily impacted by isolation by distance (IBD). Analysis of Molecular Variance (AMOVA) was conducted using the ARLEQUIN v.3.5 software to compare differences among the groups, among the samples within the same group, and among all samples (Excoffier et al., 2005). For this analysis, two regional groups, comprising the eastern (DK, DZ, CD, BN, HD, KD, XBS, HMS, JBD, LC, PL, LDN, and DL) and western (GLQ) populations, were compared. To assess the significance of the differences, 10,000 permutations were used for the AMOVA calculations.
We used STRUCTURE v2.3.4 software to analyze the population structure of M. kwangsiensis (Pritchard et al., 2000). We conducted 10 independent runs, with a burn‐in period of 10^5^ steps, followed by 10^6^ iterations, to estimate the ideal population size (K) ranging from 1 to 14. The admixture model was specified for each run. To identify the most probable K value, we used the delta K method as described previously (Evanno et al., 2005). The ideal number of population clusters was determined by the highest K value estimated by the STRUCTURE HARVESTER program (Earl & Vonholdt, 2011). The final merged results were obtained using the CLUMPP v1.1 (Jakobsson & Rosenberg, 2007) and DISTRUCT v1.1 programs (Rosenberg, 2003). A phylogenetic tree of the 14 plant populations was built using the unweighted pair group method with arithmetic mean (UPGMA), based on Nei's (1983) genetic distance, using the Populations v1.2.32 software (http://bioinformatics.org/~tryphon/populations/).
Based on the nSSR data, BARRIER v2.2 (Manni et al., 2004) was used to identify potential barriers to gene flow among the plant populations. A Voronoï tessellation map was created using Delaunay triangulation and the geographical coordinates of the 14 sample sites. Each population was contained in a polygon with edges adjacent to the neighboring sampling points. Then, Monmonier's (1973) maximum difference algorithm was used to build potential barriers among the neighboring populations through the subdivision map and a matrix of D genetic distances (Jost, 2008). The analysis was repeated on 500 resampled bootstrap D matrices in the R package “diversity” v1.9.90 (Keenan et al., 2013) to assess the robustness of the calculated barriers.
We used the BOTTLENECK software (Piry et al., 1999) to detect the possible occurrence of a bottleneck in each population. For this analysis, the Wilcoxon signed rank test and the two‐phase model (TPM) were used. The default settings for the TPM included 30% and 70% variations from the infinite alleles model (Newton et al., 2008) and the strict stepwise mutation model (SMM), respectively.
cpDNA sequence analysis
2.5
All cpDNA sequences were assembled using SeqMan (Clewley, 1995) and aligned using the CLUSTAL X tool (Thompson et al., 1997). We obtained three polymorphic fragments from the 15 primer pairs. These fragments were further amplified and used for the haplotype analysis. The number of haplotypes (h) and polymorphic sites was calculated using the DNAsp v5.10 software (Librado & Rozas, 2009). The network was constructed using the median‐joining method (Bandelt et al., 1999) in the Network v5.0 software (Xu et al., 2020) to study the genealogical relationships among all the haplotypes. The geographical distribution of the populations and haplotypes was then mapped using ArcMap GIS v10.5 (ESRI) (Ueno et al., 2005). All the generated cpDNA sequences have been deposited in GenBank under accession numbers OP909724–OP909732, OQ848601–OQ848612 (Sayers et al., 2022).
A phylogenetic tree of the cpDNA haplotypes was constructed using the ML method in IQ‐TREE v1.6.12 (Nguyen et al., 2015). The chloroplast genome sequences of Magnolia sinica (NC 023241) and Magnolia lotungensis (NC 062929), downloaded from the GenBank public sequence database, were used as outgroups after removing redundant fragments (Sayers et al., 2022). The TIM + F + I model was chosen as the most appropriate substitution model according to the lowest Akaike Information Criterion (AIC) described in IQ‐TREE ModelFinder (Kalyaanamoorthy et al., 2017). The accuracy of the branching models was determined using 1000 bootstrap replications.
Divergence times for the three polymorphic fragments from the 15 primer pairs were estimated using the BEAST v1.6.1 software under a strict clock model (Drummond & Rambaut, 2007). Evolutionary rates of 1 × 10^−9^ substitutions/site/year and 3 × 10^−9^ substitutions/site/year were used based on the evolutionary rate of angiosperm cpDNA (Drummond & Rambaut, 2007). We used the general time‐reversible (GTR) substitution model with a gamma distribution and four rate categories. The performance of different Bayesian tree priors was determined according to Yule (yule.birthRate = uniform prior [0.0, ∞]). Two independent analyses (with different evolutionary rates) were run in BEAST for 100,000,000 generations of Markov chain Monte Carlo (MCMC). Finally, the Tracer 1.7 program was used to evaluate the convergence of the chain (Rambaut et al., 2018). The top 10% of Bayesian trees were filtered using the TreeAnnotator tool (in BEAST v1.6.1).
RESULTS
3
Genetic diversity based on nSSR markers
3.1
Diversity estimates for each locus were generalized (see Appendix 2). The seven selected microsatellite loci yielded a total of 88 alleles (~12.6 per locus), with Ksep07 and stm0246 yielding the fewest (n = 6) and the most alleles (n = 23), respectively. The Hardy–Weinberg equilibrium analysis showed that only five of 98 locus‐population combinations deviated from the equilibrium after the Markov‐chain random correction (p < .01). LD testing indicated significant linkages in 41 out of 294 locus pairs; however, the linked locus pairs were found in only four populations. These findings suggested the seven microsatellite markers were relatively independent in their generational transmission. Hence, they could provide a more realistic reflection of the genetic characteristics of M. kwangsiensis.
As shown in Table 2, M. kwangsiensis exhibited a high level of genetic diversity (Na = 6.663, Ne = 3.542, Ho = 0.726, and He = 0.687). Among the 14 populations, the BN and the DZ populations exhibited the lowest and highest levels of genetic diversities, respectively (Na = 5.143 and 8.571, Ne = 2.734 and 3.677, Ho = 0.522 and 0.748, and He = 0.568 and 0.701, respectively).
Genetic structure deduction based on nSSR data
3.2
The F ST and Nm values are shown in Table 3. The pairwise F ST values ranged from 0.022 (DK and HMS) to 0.176 (GLQ and BN). However, the pairwise F ST values between GLQ and eastern populations ranged from 0.086 (DZ) to 0.176 (BN). This range of values exceeded the F ST values within the eastern populations (except for BN and PL). These results revealed a high genetic differentiation between GLQ and the eastern populations. The pairwise Nm values ranged from 2.221 (PL and LDN) to 11.230 (DK and HMS), indicating a high degree of gene flow in the eastern populations. AMOVA demonstrated that 11.46%, 8.29%, and 80.24% of the genetic variation could be attributed to intergroup differences, differences among the populations within the same group, and differences among all the populations, respectively (Table 4).
TABLE 3: Pairwise Magnolia kwangsiensis populations matrix of the genetic differentiation coefficient (F ST) (above the diagonal) and gene flow (Nm) (below the diagonal).
The BARRIER analysis (Figure 3) revealed a strong gene flow barrier between the western and eastern populations with >99% support. Only JBD in the eastern population exhibited gene flow barriers with its adjacent populations (PL, CD, DK, DZ, HD, and DL), with statistical support ranging from 67% to 72%.
Delaunay triangulation network and Voronoï tessellation of the barrier analyses for Magnolia kwangsiensis and its close relatives. Black lines show Voronoï polygons, and red lines show inferred barriers. The thickness of the barriers (red lines) indicates their rank of importance, and the percentages indicate the degree of support. Only barriers with >60% support are shown.
The Mantel test results (Figure 4a,b) corroborated the significant effect of IBD in all populations (R ^2^ = .587, p < .001). However, the genetic and geographical distances of eastern populations only mildly correlated (R ^2^ = .017, p > .05).
Correlation between genetic distance F ST/(1 − F ST) and geographic distance (GD) among 13 (a, excluding GLQ) and 14 (b, all of them) populations of Magnolia kwangsiensis.
Analysis with STRUCTURE HARVESTER revealed that the number of delta K (ΔK) had clear peaks at 2 and 9 (Figure 5a,b). Nonetheless, based on the method previously described by Evanno et al. (2005), ΔK should be used together with ln probability of data (lnPD) to discern the optimal number of genetic populations. Thus, in the present study, the optimal number was 9. Furthermore, the Bayesian model suggested a clear separation between the western and eastern populations, with a mixed origin for the eastern populations (Figure 5c). Similar results were observed via the PcoA based on F ST values and the UPGMA tree based on Nei's genetic distance (Figure 6 and Appendix 4). These results indicated that the eastern populations did not exhibit a clear geographic genealogical structure, and the western and eastern populations exhibited a high genetic differentiation.
Scatterplot of the mean of the estimated ln probability of data (lnPD) versus the number of simulated clusters K (a), the relationship between K and Delta K (b), and the population structures with K = 2 and K = 9 (c) based on nuclear simple sequence repeat.
Principal coordinate analysis (PCoA) of all specimens from the 14 Magnolia kwangsiensis populations.
Bottleneck analyses showed that, based on TPM conditions, the allele distribution of each population exhibited a normal L‐shaped distribution, which was consistent with the mutation‐drift equilibrium. The Wilcoxon test revealed a significant heterozygosity in JBD (p < .05), indicating its deviation from the mutation‐drift equilibrium. However, none of the other 13 populations exhibited a recent genetic bottleneck (Table 2).
Parentage analysis
3.3
Based on the offspring‐mother genotype comparison, parentage analysis of the seven microsatellite loci yielded a total of 63 alleles (~9 alleles per locus). The fewest (n = 4) and the most (n = 15) alleles were detected for Ksep07 and stm0246, respectively. The abundance of genotypes enabled a thorough genetic analysis of the relationship between conspecific individuals. The analysis of genotyping errors revealed that the error rates between seeds and mother trees ranged from 0 (at locus Ksep07) to 0.035 (at locus stm0246), with all loci less than 5% (i.e., 0.05). Based on these results, seven corresponding primer pairs were selected for parentage analysis.
Further parentage analysis helped identify the male parent of 132 seeds (40.12%) and the female parent of 34 saplings (52.3%; refer to Figure 7a,b and Appendix 3 for details). The pollen dispersal distance ranged from 2.2 to 535.4 m (average distance: 66.4 m). For seven out of 132 seeds (5.3%), pollen flow was identified between DK and DZ. The seed dispersal distance ranged from 0.2 to 553.8 m, averaging 95.7 m. For three out of 34 saplings (8.8%), seed dispersal was identified between DK and DZ. However, the female parents of the remaining specimens [n = 31/65 (47.7%)] could not be identified. It is possible that these female parents were present outside the study area or were among the 132 non‐flowering individuals.
Distance versus frequency histograms of pollen (a) and seed (b) dispersal distance for Magnolia kwangsiensis.
Haplotypes based on cpDNA sequences
3.4
After gene tandem and alignment, the fragments amplified using the 15 primer pairs (see Appendix 1) had a total length of 14,703 bp. Only three primer pairs with polymorphisms—rpoB‐trnC, ndhF‐rpl32, and rpl32‐trnL—were selected for the haplotype analysis. After sequence splicing and artificial correction, we obtained 3663 bp long cpDNA sequences from 140 individuals. Three polymorphic loci were detected, and three chloroplast haplotypes were uncovered (H1–H3, see Appendix 5). The different haplotypes and their network and geographical distribution are presented in Table 2 and Figure 1a,b,d. Evidently, haplotype H1 was the most abundant and widely distributed. It was found in 12 populations: DK, DZ, CD, BN, HD, KD, XBS, HMS, JBD, LC, LDN, and DL. In stark contrast, haplotypes H2 and H3 were unique to the GLQ and PL populations, respectively (Table 2). In the haplotype network, H2 was separated from H3 and H1 by three and four mutational steps, respectively (Figure 1d).
Molecular dating by cpDNA sequences
3.5
The phylogenetic tree built using the ML method divided the three haplotypes into two genetic lineages (Figure 8). Further analysis revealed that the divergence time between the western (H2) and eastern populations spanned 211.5 ka (1 × 10^−9^ substitutions/site/year) to 69.9 ka (3 × 10^−9^ substitutions/site/year) before the present (BP) (Figure 8).
Maximum likelihood phylogenetic tree of the three chloroplast haplotypes (H1–H3) detected in Magnolia kwangsiensis using M. sinica and M. lotungensis as the outgroups. Each internal node is labeled with the posterior probabilities with rates of 1 × 10−9 substitutions/site/year and 3 × 10−9 substitutions/site/year. The scale represents thousands of years ago (ka).
DISCUSSION
4
Genetic differentiation between two disjunct regions
4.1
The chloroplast haplotype tree, structure analysis, PCoA, and UPGMA tree analysis indicated that the 14 populations split into two differentiated groups, which was consistent with their disjunct geographical distribution (Figures 5c and 6 and Appendix 4). BARRIER analysis showed a strong gene flow barrier between the two differentiation groups (Figure 3). Additionally, AMOVA suggested that intergroup variation was higher (11.46%) than the variation among populations within the same group (8.29%) (Table 4). The divergence time between western (GLQ) and eastern populations was estimated to be ca. 221.5–69.9 ka (Figure 8), which seemed to coincide with the Last Glacial Period (LGP) (Yi et al., 2005). Therefore, we speculated that M. kwangsiensis persisted in both these regions during the glacial periods, resulting in the current disjunct distributions of its populations. Limestone mountains have been widely linked to refugia (Bátori et al., 2019). The refugia in the KASC evolved to support Cycas (Tao et al., 2021), Primulina (Ke et al., 2021), and Heteroplexis (Zhu et al., 2022). However, the complex topography of the karst landscape can buffer the extreme climatic conditions, allowing for the survival of existing species and the emergence of new lineages (Hewitt, 2000; Tzedakis et al., 2002).
Strong gene flow among the eastern populations
4.2
The F ST values indicated a high level of gene flow among the eastern populations. The gene flow of M. kwangsiensis is achieved through pollen and seed transmissions. Among the overall pollen flow among the selected population, ~5.3% occurred between DK and DZ (>400 m). Furthermore, the longest pollen dispersal distance was 535.4 m. These findings indicated that long‐distance pollen dispersal could occur among the M. kwangsiensis populations. The pollen dispersal distances detected in the present study were consistent with those observed for the same genus M. stellata (Setsuko et al., 2007) and M. obovata (Isagi et al., 2004), with a corresponding maximum dispersal distance of 420 m and 500, respectively. Thrips are the most frequent and effective pollinators of M. kwangsiensis (Lai, Pan et al., 2007). Although thrips cannot fly over long distances (Denmark et al., 1996), they can spread farther with wind support (Lewis, 1991). Hence, thrip pollination can be viewed as an intermediate strategy between wind pollination and insect pollination (Chan & Appanah, 1980). These findings indicated that thrips might be dispersed by the wind over long distances while carrying M. kwangsiensis pollen on their bodies.
Pollen dispersal can contribute to gene exchange between close populations, but gene exchange among M. kwangsiensis populations over long distances is more likely mediated via seed dispersal. In the present study, we observed an average seed dispersal distance of 95.7 m, with the maximum dispersal distance detected between DK and DZ (553.8 m). Additionally, 8% of the overall seed dispersal occurred between DK and DZ (>400 m). Moreover, we could not trace the female parents of 47.7% of the dispersed seeds, indicating that these seeds might have dispersed from outside populations. M. kwangsiensis seeds are known to be dispersed by 14 frugivorous bird species, such as the chestnut bulbul (Hemixos castanonotus), striated yuhina (Yuhina castaniceps), and scarlet minivet (Pericrocotus flammeus) (Wang et al., 2019). In addition, frugivorous birds can facilitate long‐distance seed dispersal (Isagi et al., 2004; Mueller et al., 2014; Viana et al., 2016). Seed dispersal is the only way populations can exchange individuals or colonize new habitats. This ecological process is critical for the survival of plant species in patchy landscapes (Browne & Karubian, 2018). There are 72 M. kwangsiensis fragmented distribution sites in the eastern distribution region (Ling et al., 2021; Liu et al., 2020). Although most sites harbor only a few individuals, they can serve as stepstones for gene exchange. Long‐distance pollen and seed dispersal, combined with these stepstones for gene exchange, help maintain genetic connectivity among the fragmented M. kwangsiensis populations.
High genetic diversity in M. Kwangsiensis
4.3
Contrary to the general expectation from a spatially isolated rare species, M. kwangsiensis does not exhibit a reduced genetic diversity at either the population or the species level unlike a related Magnoliaceae species (Table 5) (Fan et al., 2020; Kikuchi & Osone, 2021; Sun et al., 2011; Tamaki et al., 2008, 2018; Wang et al., 2018; Xiao et al., 2023; Xiong et al., 2014; Yang, Yang & Li, 2018; Zhang et al., 2017; Zhao et al., 2012; Zhou et al., 2021; Deng et al., 2020). Obligate outcrossing plant species tend to be more genetically diverse (Hamrick & Godt, 1996; Jia et al., 2015; Muyle et al., 2020; Wang et al., 2010; Zong et al., 2015). Given that dioecious species are rare in the Magnoliaceae, our findings might be attributed to the dioecy of M. kwangsiensis, which might help its populations to avoid inbreeding and facilitate the involvement of many paternal individuals in the mating process. Moreover, our analysis did not reveal any recent bottleneck in M. kwangsiensis. Effective gene exchange via pollen and seed dispersal between isolated population patches helps to retain the genetic diversity of M. kwangsiensis.
Conservation implications
4.4
Paternity analysis demonstrated that isolated patchy trees exhibit the potential for long‐distance pollen dispersal, a phenomenon also observed in other dioecious tree species in the KASC (Wang et al., 2014). Our results showed that pollen and seed dispersal can and does occur among M. kwangsiensis populations over large distances in highly fragmented karst forest landscapes. These phenomena further benefit from substantial intra‐ and inter‐patch pollinator movement and bird‐mediated dispersal because they enhance genetic connectivity across the fragmented landscape, constituting a functional network of gene flow and exchange. Nevertheless, two genetic groups of M. kwangsiensis, consistent with their disjunct distribution, were identified. Therefore, ensuring sufficient local tree densities and that inter‐fragment distances are maintained within a certain range could potentially reduce the negative genetic consequences of the fragmented karst habitat, even in species with the potential for long‐distance pollen‐ and seed‐mediated gene flow. Considering the current M. kwangsiensis population structure, its disjunct population groups should be treated as separate management units. Notably, LC lay outside the nature reserve, and JBD exhibited gene flow barriers against other populations. Therefore, LC and JBD deserve particular conservation attention.
CONCLUSIONS
5
Both pollen flow and seed dispersal helped maintain genetic connectivity among isolated M. kwangsiensis populations, resulting in its high genetic diversity across the karst landscape. The two genetically differentiated groups analyzed in this study were consistent with the two disjunct regions of the species. Our results provided a robust genetic basis for the conservation of M. kwangsiensis.
AUTHOR CONTRIBUTIONS
Shaoqing Tang: Conceptualization (equal); funding acquisition (lead); project administration (lead); supervision (lead); validation (lead); visualization (lead); writing – review and editing (equal). Zhiyong Zhang: Conceptualization (equal); writing – review and editing (supporting). Sujian Wei: Conceptualization (equal); supervision (supporting); writing – review and editing (equal). Yingying Xiang: Conceptualization (equal); data curation (supporting); formal analysis (supporting); validation (supporting); visualization (supporting); writing – original draft (lead); writing – review and editing (supporting). Yanfang Lin: Conceptualization (equal); data curation (lead); formal analysis (lead); investigation (equal); methodology (lead); software (lead). Yanhua Liu: Data curation (supporting); investigation (equal). Qiwei Zhang: Data curation (supporting); formal analysis (supporting); investigation (equal); methodology (supporting); software (supporting).
FUNDING INFORMATION
This study was supported by the Guangxi Natural Science Foundation (2010GXNSFA013069) and the Survey and Assessment of priority areas for terrestrial biodiversity conservation in Guangxi (2022–2023).
CONFLICT OF INTEREST STATEMENT
The authors declare no conflict of interest.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Aguilar, R. , Quesada, M. , Ashworth, L. , Herrerias‐Diego, Y. , & Lobo, J. (2008). Genetic consequences of habitat fragmentation in plant populations: Susceptible signals in plant traits and methodological approaches. Molecular Ecology, 17, 5177–5188.19120995 10.1111/j.1365-294X.2008.03971.x · doi ↗ · pubmed ↗
- 2Aguilar‐Aguilar, M. D. , Cristobal‐Pérez, E. J. , Lobo, J. , Fuchs, E. J. , Oyama, K. , Martén‐Rodríguez, S. , Herrerías‐Diego, Y. , & Quesada, M. (2023). Gone with the wind: Negative genetic and progeny fitness consequences of habitat fragmentation in the wind pollinated dioecious tree. American Journal of Botany, 110, e 16157.36934453 10.1002/ajb 2.16157 · doi ↗ · pubmed ↗
- 3Bandelt, H.‐J. , Forster, P. , & Röhl, A. (1999). Median‐joining networks for inferring intraspecific phylogenies. Molecular Biology and Evolution, 16, 37–48.10331250 10.1093/oxfordjournals.molbev.a 026036 · doi ↗ · pubmed ↗
- 4Bátori, Z. , Vojtkó, A. , Maák, I. E. , Lőrinczi, G. , Farkas, T. , Kántor, N. , Tanács, E. , Kiss, P. J. , Juhász, O. , Módra, G. , Tölgyesi, C. , Erdős, L. , Aguilon, D. J. , & Keppel, G. (2019). Karst dolines provide diverse microhabitats for different functional groups in multiple phyla. Scientific Reports, 9, 7176.31073136 10.1038/s 41598-019-43603-x PMC 6509348 · doi ↗ · pubmed ↗
- 5Browne, L. , & Karubian, J. (2018). Habitat loss and fragmentation reduce effective gene flow by disrupting seed dispersal in a neotropical palm. Molecular Ecology, 27, 3055–3069.29900620 10.1111/mec.14765 · doi ↗ · pubmed ↗
- 6Chan, H. T. , & Appanah, S. (1980). Reproductive biology of some Malaysian dipterocarps. I: Flowering biology. The Malaysian Forester, 43, 132–143.
- 7Clements, R. , Sodhi, N. S. , Schilthuizen, M. , & Ng, P. K. L. (2006). Limestone karsts of Southeast Asia: Imperiled arks of biodiversity. Bioscience, 56, 733–742.
- 8Clewley, J. P. (1995). Macintosh sequence analysis software. DNA Star's Laser Gene. Molecular Biotechnology, 3, 221–224.7552691 10.1007/BF 02789332 · doi ↗ · pubmed ↗
