Yam genetic diversity in Sub-Saharan Africa: implications for conservation and breeding
Amal Messadia, Paterne Agre, Kwabena Darkwa, Emmanuel Amponsah Adjei, Konan Evrard Brice Dibi, Lassana Bakayoko, Désiré N’Da Pokou, Alexandre Dansi, Amani Kouakou Michel, Idris Adejumobi, Adeyinka Adewumi, Jude Obidiegwu, Hapson Mushoriwa, Balogun Morufat

TL;DR
This study explores the genetic diversity of yam species in Sub-Saharan Africa to guide conservation and breeding efforts.
Contribution
The study provides new insights into the genetic relationships and diversity of yam species in Sub-Saharan Africa.
Findings
Substantial inter- and intra-specific genetic variation was observed in African yam germplasm.
Clustering and phylogenetic analysis revealed evolutionary relationships among key yam species.
Moderate genetic differentiation was found among countries and higher differentiation among species.
Abstract
Yam (Dioscorea spp.) encompasses diverse species, including several staple food crops, of which a few were domesticated on the African continent. This study assessed yam genetic diversity in Sub-Saharan Africa (SSA) to inform breeding and conservation initiatives. A diverse collection of 1,247 yam accessions representing six species (D. rotundata, D. alata, D. praehensilis, D. bulbifera, D. cayenensis, and D. dumetorum) sourced from six African countries (Benin, Côte d’Ivoire, Democratic Republic of Congo, Ghana, Nigeria, and Uganda) was used in this study. Genetic diversity was assessed using 7,648 single-nucleotide polymorphism (SNP) markers, selected from previously sequenced datasets between the Consultative Group on International Agricultural Research (CGIAR) and National Agricultural Research and Extension Systems (NARES) collaboration. Findings showed a substantial inter- and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9- —https://doi.org/10.13039/100000865Bill and Melinda Gates Foundation
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPotato Plant Research · Cassava research and cyanide · Agricultural pest management studies
Introduction
Yam (Dioscorea spp.) is a key food security crop in sub-Saharan Africa (SSA), particularly in West and central Africa, where it provides a reliable source of calories, income, and cultural identity for millions of rural households [1, 2]. In West Africa alone, over 300 million people rely on yam as a staple food [3]. In addition to its high carbohydrate content, yam contributes essential micronutrients and is integral to local ceremonies, festivals, and economies [4–6]. The region produces more than 90% of global yam output, making it the epicenter of yam cultivation [7, 8].
The genus Dioscorea comprises approximately 600 species, of which only 11 are cultivated for food or medicinal use [9–11]. Among these, Dioscorea rotundata (white yam) and D. alata (water yam) are the most widely grown, accounting for about 90% of global production. D. rotundata is indigenous to African, likely domesticated in West Africa’s Niger River basin from natural hybridization between the savannah species D. abyssinica and the rainforest species D. praehensilis, as evidenced by recent genomic studies [12, 13]. The yam domestication is still an ongoing process in Africa and offers valuable insights into how farmers harness genetic resources to cultivate crops that meet their agricultural needs [14]. This long domestication history underscores the region’s yam genetic diversity and its agronomic significance in the food system.
Genetic diversity in yam has been explored using both morphological and molecular tools. Studies have reported high diversity in Chinese yam (D. opposita) [15, 16] and regional differentiation in African species [17, 18]. For example, D. rotundata accessions in Benin show low to moderate diversity, shaped by seed exchange and clonal propagation [19]. Ethiopian yams have been shown to be genetically distinct from West African genotypes [20], suggesting independent evolutionary histories. Furthermore, local adaptation has driven within-country diversity in traits like flowering, tuber morphology, and yield [18].
Advances in molecular markers have enhanced the resolution of diversity studies. Simple Sequence Repeats (SSRs) and Amplified Fragment Length Polymorphism (AFLP) markers initially revealed high polymorphism [21, 22], while newer tools such as DArTseq and genotyping-by-sequencing (GBS) have facilitated genome-wide analysis. DArTseq has identified over 10,000 SNPs in D. rotundata, enabling structure and association mapping [10, 18, 23]. A 600 K SNP panel has also been used to reconstruct evolutionary relationships between wild and cultivated yams, revealing selection sweeps associated with domestication traits [12].
Despite these advances, major knowledge gaps persist in yam genetic research including geographical biases in research focus, limitations in the types of germplasm utilized, constraints in the application of molecular markers, and various bottlenecks in breeding programs [24–27].
Recent studies show a shift from reliance on genebank materials to farmer-managed landraces, emphasizing on-farm diversity and indigenous knowledge. For instance [28], collected and characterized yam cultivars from Ekiti State, Nigeria, while [29] analyzed chromosomal variation among cultivated species from southwest Nigeria. Similar efforts in Ethiopia documented folk taxonomy and morphological diversity of landraces [30, 31]. These studies highlight the value of locally adapted germplasm but remain geographically limited and lack genomic integration across regions. Thus, despite these advances, significant knowledge gaps persist in understanding the continent-wide genetic diversity and evolutionary relationships among Dioscorea spp. In countries like Kenya, the Democratic Republic of the Congo (DRC), yam remains poorly characterized or ignored entirely in genomic initiatives [32, 33]. These gaps hinder the identification of valuable alleles for stress tolerance, disease resistance, and nutritional traits. The continued use of low-resolution markers in some regions further limits the discovery of fine-scale genomic variation [22, 32].
Breeding programs are further challenged by the yam’s complex biology, which includes erratic flowering, poor seed set, dioecy, and heterozygosity [34]. Without a comprehensive understanding of available genetic variation across the continent, breeders risk narrowing the genetic base and missing opportunities to improve the crop’s resilience and productivity [35].
To address these gaps, this study assessed the genetic diversity and population structure of 1,247 yam accessions representing six Dioscorea species from six African countries: Uganda, Ghana, Nigeria, Benin, Côte d’Ivoire, and the Democratic Republic of Congo. Specifically, we aimed to evaluate the extent of genetic variation within and among yam species, assess the population structure and gene flow across geographic regions using high-density SNP markers.
Materials and methods
Plant materials
A total of 1,247 yam genotypes from various published studies were selected from six countries and used in this study. Diversity Arrays Technology (DArT) sequencing was applied to generate six datasets used in this study. The first dataset comprised 207 D. rotundata accessions from Uganda [18]. In addition, 188 D. alata samples obtained from Côte d’Ivoire formed the second dataset [36]. The third dataset consisted of 227 D. praehensilis accessions collected from Ghana [37]. The fourth dataset was sourced from Benin that composed of 85 D. rotundata accessions [19]. The fifth dataset is included 194 accessions of both D. rotundata and D. alata from Nigeria [10, 35], and another 173 D. rotundata accessions generated through whole-genome sequencing [24]. The sixth dataset, composed of 182 accessions from DRC [38] (Fig. 1). All the data used in this study can be downloaded from the following link (https://figshare.com/account/items/29314031/edit). During the original germplasm collection, local names and origins of the yams were documented. Wild-related yam species collected from Ghana and the Democratic Republic of Congo were identified with the assistance of local farming communities and based on established descriptors, as reported in [39] and [38]. Farmers provided their consentbefore the collection, and the purpose of the collection was clearly explained to them.
Fig. 1. Origins of the 1247 accessions used in this study
Dataset compilation and quality control
All variant call format (VCF) files generated from the above projects were merged using a customized Perl script, retaining only the common SNP marker positions to produce a unified VCF file containing all genotypes. Quality control involved merging VCF files for every dataset into a single VCF file. Filtering was done to remove SNP markers with poor quality using PLINK 1.9 and VCF tools. Markers with high missing values > 80% as well as duplicate SNPmarkers were removed. Markers with low minor allele frequencies (< 0.05) were all removed.
Genotypic data analysis
Observed and expected heterozygosity, minor allele frequency (MAF), and polymorphic information content (PIC) were calculated with VCF tools and PLINK 1.9 [40]. CMplot package was also used for marker density and distribution on the 20 yam chromosomes [15]. SNP data were subjected to population structure analysis following the method described by [35]. By testing cluster numbers ranging from 2 to 50, the optimal number of clusters was identified through k-means analysis, employing cross-validation on the basis of the Bayesian information criterion (BIC). Each yam genotype was then assigned to its respective cluster if it had at least 70% ancestry probability. Genotypes with less than 70% ancestry were considered as admixed. The diversity pattern revealed through population structure analysis was further supported by discriminant analysis of the principal component (DAPC) via the Adegenet package [41]. Pairwise genetic dissimilarity distances based on identity-by-state (IBS) were estimated using plink1.9 version Linux command based. Using the IBS, hierarchical cluster dendrogram was generated and visualized using Phylogenetics and Evolution (APE) package in R [42]. Network analysis was conducted using NetworkX package in Python [43] with countries and yam species considered as factors. Analysis of molecular variance (AMOVA) within and among different yam species and countries was estimated using the fixation index (Fst) implemented in VCFtools.
Results
Genotypic summary statistics
A total of 7,648 SNP markers were retained after filtering, with unequal distribution across the 20 yam chromosomes (Fig. 2; Table 1). The genome-wide SNP density plot indicated that chromosome 5 had the highest concentration of SNPs, accounting for 10.66% of the total number of markers with 816 SNPs. In contrast, chromosome 13 had the lowest marker concentration, with only 1.93% of the SNPs, totaling 148 markers. The diversity indices for the SNP markers presented a mean PIC value of 0.232, ranging from 0.194 on chromosome 15 to 0.265 on chromosome 6. The MAF averaged 0.213 across all the markers, with values ranging from 0.176 (chromosome 15) to 0.247 (chromosome 9). The observed heterozygosity (Ho) ranged from 0.209 on chromosome 3 to 0.337 on chromosome 6, with an average of 0.259. The expected heterozygosity (He) varied between 0.239 (chromosome 15) and 0.328 (chromosome 6), with an average of 0.288 (Table 1; Fig. 2).
Table 1. Summary statistics of SNP markers across 20 Yam chromosomesChromosomeNo of SNPsHeHoMAFPICChr12560.2620.2390.1820.216Chr23280.2990.2620.2240.239Chr33120.2720.2090.1960.222Chr45400.2900.2540.2120.235Chr58160.2660.2360.2000.214Chr62440.3280.3370.2380.265Chr74360.2840.2620.2000.232Chr84960.2950.2680.2230.237Chr92280.3270.2920.2470.260Chr103640.3120.2820.2330.250Chr112960.2920.2800.2190.234Chr123680.3070.2750.2280.246Chr131480.3060.2630.2370.243Chr144280.2840.2580.2080.230Chr153760.2390.2190.1760.194Chr162680.3070.2480.2260.248Chr174080.2810.2550.2150.225Chr183720.2500.2340.1790.205Chr197000.2860.2660.2110.232Chr202640.2680.2370.2010.216Total7648average382.40.2880.2590.2130.232SD0.02310.02730.01960.0173Chr chromosome, Ho observed heterozygosity He expected heterozygosity, MAF minor allele frequency, PIC polymorphic information content, SD standard Deviation
Fig. 2. Distribution of SNP markers across the 20 yam chromosomes. Each color represents the marker density at a particular region of the yam chromosome
In terms of mutation types, transition SNPs (Ts) accounted for 61.08% of the total SNPs, while transversion SNPs (Tv) represented 39.1% (Fig. 3). Among these, A/G transitions were the most common (18.2%), whereas G/T transversions were the least frequent (0.7%).
Fig. 3. Rate of Transition and Transversion mutations based on bi-allelic SNP markers
The genetic diversity parameters of yam accessions across six countries revealed moderately high levels of expected heterozygosity (He), ranging from 0.436 (Côte d’Ivoire) to 0.451 (Benin), with an overall average of 0.444. Observed heterozygosity (Ho) was slightly lower across all countries, with values ranging from 0.249 to 0.264. Nigeria exhibited the highest MAF (0.246) and PIC (0.302) while Uganda showed the lowest MAF (0.229) and PIC (0.289). Overall, findings indicate a relatively balanced level of genetic diversity across countries, with Nigeria slightly standing out for its higher diversity indices (Table 2).
Table 2. Summary of genetic diversity parameters (Ho, He, MAF, and PIC) in Yam accessions from six West, Central, and East African countriesCountryHoHeMAFPICBenin0.2490.4510.2370.296CIV0.2640.4360.2360.295DRC0.2550.4450.2390.297Ghana0.2530.4470.2350.295Nigeria0.2580.4420.2460.302Uganda0.2550.4450.2290.289Average0.2560.4440.2370.296SD0.0050.0050.00510.0038SD standard deviation, Ho observed heterozygosity He expected heterozygosity, MAF minor allele frequency, PIC polymorphism information content, CIV Cote d’Ivoire, DRC Democratic Republic of Congo
At the species level, genetic diversity analysis across six Dioscorea species revealed moderate to high expected heterozygosity (He), with values ranging from 0.412 in D. bulbifera to 0.454 in D. cayenensis. Observed heterozygosity (Ho) varied slightly among species, with D. bulbifera showing the highest Ho (0.288), suggesting relatively higher outcrossing or genetic variation in that species. Minor allele frequency (MAF) was consistent across species (average = 0.236), indicating comparable allelic distributions. Polymorphic information content (PIC) values ranged from 0.354 to 0.363. Overall, D. rotundata,* D. praehensilis*, and D. dumetorum exhibited slightly higher marker informativeness, supporting their relevance in breeding and conservation efforts (Table 3).
Table 3. Summary of genetic diversity parameters(Ho, He, MAF, and PIC) across six Yam speciesSpeciesHoHeMAFPIC D. bulbifera0.2880.4120.2370.360 D. alata0.2660.4340.2300.354 D. cayenensis0.2460.4540.2370.360 D. dumetorum0.2550.4450.2370.362 D. praehensilis0.2530.4470.2360.361 D. rotundata0.2540.4460.2380.363 Average0.2590.4410.2360.360 SD0.01370.01370.00270.0029Ho observed heterozygosity, He expected heterozygosity, MAF minor allele frequency, PIC polymorphism information content
Population stratification and diversity assessment
Population structure was assessed using ADMIXTURE (with cross-validation error for optimal K), DAPC (with BIC for K), and an IBS-based Neighbor-Joining (NJ) tree, were employed to investigate the population structure of 1,247 yam accessions. Using the Bayesian Information Criteria (BIC), we notice K = 4 a rapid decline was observed suggesting the grouping of the entire population in 4 major groups (Fig. 4). Group 1 (red) included 180 accessions, primarily 165 of Dioscorea praehensilis from Ghana, with some from DRC and Uganda. Group 2 (blue) was made of 542 genotypes, mainly 524 D. rotundata. Group 3 (green) contained 194 accessions, mostly D. praehensilis and Group 4 (cyan) had 332 accessions made mainly D. alata (Figure. 4).
Fig. 4. Discriminant analysis of principal components (DAPC) using 7648 SNP markers. Each color represents a cluster (cluster1 is red, cluster 2 is blue, cluster 3 is green, cluster 4 is cyan, and each dot represents an individual. Numbers represent the different subpopulations identified by DAPC analysis
Using the ADMIXTURE, four subpopulations similarly to DAPC were identified (Fig. 5), A total of 968 genotypes were successfully assigned to four subpopulations with an ancestry probability ≥ 70%. The remaining 22.4% (279 genotypes) were classified as admixed withancestry probability < 70%. In group 1 (red), majority of the yam genotypes were.
D. rotundata collected from different countries. Group 2 (blue) included 214 accessions, composed of D. rotundata (115), with some D. cayenensis [6], and D. praehensilis (63). Group 3 (green) contained mainly genotypes of D. alata from Côte d’Ivoire. while Group 4 (cyan) had a total of 163 accessions of D. rotundata (105) and D. praehensilis [33]. The admixture accessions were breeding lines of both D. rotundata and D. alata with few accessions of D. praehensilis.
Fig. 5. Population structure with K = 4 for 1247 yam accessions into 4 clusters based on the SNP data using Bayesian-based clustering analysis. A vertical bar represents each accession. The coloured sections in a bar indicate the membership coefficients of the accessions in the different clusters. Identified subgroups are: cluster 1 (red), cluster 2 (bleu), cluster 3 (green), and cluster 4 (cyan)
To assess the evolutionary relationships among Dioscorea species, we constructed a Neighbor-Joining (NJ) phylogenetic tree based on genome-wide SNP data (Fig. 6). The tree clearly differentiates the accessions by species, with well-resolved clusters corresponding to D. rotundata,* D. alata*,* D. praehensilis*,* D. bulbifera*,* D. dumetorum*,* and D. cayenensis*. Dioscorea rotundata accessions (green) form a highly diverse clade, indicating moderate genetic diversity among both landraces and breeding lines. In contrast, D. alata (blue) also forms a distinct clade, though with slightly low internal diversity and several extended branches.
A third major cluster comprises D. praehensilis (purple), which shows a wide and deeply branched clade, suggesting substantial intra-specific genetic variation.
The remaining species D. cayenensis (orange), D. bulbifera (red), and D. dumetorum (cyan) occupy more peripheral positions on the tree and are clearly separated from the core clusters, reflecting their more divergent genomic backgrounds and supporting their classification as genetically distinct species within the genus.
Overall, the NJ tree topology supports the species classification derived from previous PCA and structure analyses and provides additional evidence of strong species-level divergence, particularly between cultivated and wild/semi-domesticated yam relatives.
Fig. 6. Unrooted Hierarchical cluster based on Jaccard genetic dissimilarity matrix of 1247 showing the grouping pattern of the Dioscorea species accessions, independent of the origin country
To gain deeper insights into the genetic relationships among yam accessions, we conducted a network analysis based on pairwise genetic distances and visualized the results using force-directed graphs. Two complementary network views were generated: one representing the country of origin (Fig. 7) and the other species identity (Fig. 8). In both networks, nodes represent individual genotypes, and edges denote genetic similarity above a defined threshold.
Figure 7 displays the network topology with nodes color-coded according to geographic origin, including Nigeria, Ghana, Benin, Côte d’Ivoire, DRC, and Uganda. The network structure reveals extensive interconnections across accessions from different countries, suggesting gene flow or the sharing of germplasm across national borders. A moderately cohesive subnetwork is observed toward the lower right of the graph, largely composed of accessions from Benin and Uganda, suggesting some country-specific structure. However, a broader, more diffuse cloud in the upper half of the network contains a mix of accessions from Ghana, Nigeria, Côte d’Ivoire, and DRC, indicating high genetic similarity among these groups.
In contrast, Fig. 8 shows the same network with nodes colored by species: D. rotundata, D. alata,* D. praehensilis*,* D. cayenensis*,* D. bulbifera*, and D. dumetorum. Here, a much clearer structuring pattern emerges. Three major clusters dominate the graph: a large, tightly connected cluster of Dioscorea rotundata accessions (red nodes); a second cluster containing mostly Dioscorea alata (blue) and Dioscorea praehensilis (cyan); and a sparse periphery consisting of more genetically distant species such as Dioscorea cayenensis,* Dioscorea dumetorum*, and Dioscorea bulbifera. This species-based organization aligns well with the ADMIXTURE, DAPC, and phylogeny tree and confirms that species identity, rather than geography, is the primary driver of genetic structure in the yam diversity panel analyzed.
Fig. 7. Network-based visualization of genetic relatedness among yam genotypes from Nigeria, Ghana, Benin, Côte d’Ivoire, the Democratic Republic of Congo, and Uganda
Fig. 8. Network-based visualization of genetic relatedness among the six yam species
The AMOVA results provided further insights into genetic differentiation across countries and species (Tables 4 and 5). Among-country, an average fixation index (Fst) of 0.07, indicating moderate differentiation (Table 4). The closest genetic relationship was between Ghana and Nigeria (Fst = 0.08) (Table 4), while the highest genetic distance was observed between Uganda and DRC (Fst = 0.16) (Table 4). Within-species variation showed moderate differentiation (average Fst = 0.14) (Table 5), with the highest variation observed in Dioscorea rotundata (Fst = 0.18) and the lowest in D. cayenensis (Fst = 0.04) (Table 5). Between-species Fst values ranged from 0.10 to 0.28, with the closest relationship was between D. praehensilis and D. cayenensis (Fst = 0.10), and the highest divergence was between D. rotundata and D. alata (Fst = 0.28). Cluster analysis grouped the six yam species into two major clusters, highlighting the genetic proximity of D. rotundata and D. praehensilis and the distinctiveness of D. dumetorum (Fig. 9).
Table 4. Fixation index (Fst) based on countries of originCountryBeninCIVDRCGhanaNigeriaUganda Benin0.02 CIV0.10.04 DRC0.050.060.07 Ghana0.060.150.110.06 Nigeria0.060.080.090.080.08 Uganda0.040.130.160.030.070.01 Average0.0550.0920.1080.0570.0750.01The diagonal value represent the diversity index within country. DRC, Democratic Republic of Congo; CIV, Côte d’Ivoire.
Table 5. Fixation index (Fst) based on Yam speciesSpeciesTDrTDaTDcTDpTDdTDb TDr0.18 TDa0.280.12 TDc0.120.220.04 TDp0.150.270.100.14 TDd0.220.190.230.210.08 TDb0.210.110.300.170.110.02 Average0.1930.1820.1680.1730.0950.02TDr, Dioscorea rotundata; TDa, D. alata; TDc, D. cayenensis; TDp, D. praehensilis; TDd, *D. dumetorum;*TDb, D. bulbifera
Fig. 9. Hierarchical Dendogram-based on Dioscorea species evolutionary (D is for Dioscorea)
Discussion
This study presents one of the most comprehensive assessments of genetic diversity in yam (Dioscorea spp.) conducted in sub-Saharan Africa, comprising 1,247 accessions across six countries and six yam species, including both cultivated and wild related. Using 7,648 (https://figshare.com/account/items/29314031/edit) SNP markers, we generated a robust genomic framework to elucidate the evolutionary relationships, population structure, and genetic differentiation within African yam germplasm.
Diversity estimates based on SNP data revealed moderate polymorphism levels across species. The average observed heterozygosity (Ho = 0.259) was consistently lower than the expected heterozygosity (He = 0.441), suggesting a degree of excess homozygosity. This is consistent with the clonally propagated nature of yam, limited recombination, and potential sub-structuring within population patterns previously reported in clonally propagated crops [12, 19, 44]. The slightly higher observed heterozygosity (Ho = 0.288) for D. bulbifera in this study compared with the value reported by [23] (Ho = 0.262) could be attributed to differences in germplasm composition, marker density, and population structure. The inclusion of accessions from multiple geographic regions and the use of high-resolution SNP markers may have contributed to the broader allelic representation and increased heterozygosity observed. Expected heterozygosity (He) ranged from 0.262 to 0.277, with D. cayenensis showing the highest value (He = 0.454), indicating species-specific variation in allele richness and genetic diversity [18, 45, 46].
These diversity estimates are in agreement with previous reports based on SSR and SNP marker analyses [7, 17, 19]. The observed stability in MAF (0.236) and PIC (0.360), with the Nigerian population and D. rotundata exhibiting the highest PIC values (0.246 and 0.363, respectively), underscores the high informativeness and discriminatory power of the SNP panel. This further highlights its suitability for genetic diversity assessment and breeding applications in Dioscorea species [24]. The moderate disparity between expected (He) and observed (Ho) heterozygosity suggests a reduction in genetic variability within the population. This pattern may reflect historical bottlenecks, domestication pressures, and allele fixation events associated with clonal propagation and recurrent selection processes previously reported during the domestication and genetic refinement of Dioscorea rotundata [12, 13, 47].
Population structure analysis identified four well-resolved genetic clusters using ADMIXTURE and DAPC, with a proportion of admixed individuals indicative of historical gene flow and interspecific introgression [24, 38]. These patterns are consistent with traditional farmer practices involving clonal propagation and exchange of diverse landraces, which promote population connectivity [48]. Phylogenetic overlap between D. rotundata and D. praehensilis supports the hypothesisof D. rotundata been originated from a long domestication process from wild progenitors.Scarcelli et al., (2006) and Sugihara et al., (2020) highlighted the value of D. praehensilis as a reservoir of favorable alleles for traits such as disease resistance and yield potential [13, 21].
In contrast, Dioscorea alata showed relatively low intra-species variability, likely due to its introduction from the Pacific with a narrow founder base, followed by clonal expansion and limited recombination in Africa [49, 50]. This supports the need to broaden its genetic base through introgression and targeted diversity enrichment.
AMOVA revealed moderate differentiation among countries (Fst = 0.07) and higher differentiation among species (Fst = 0.14), in agreement with [38, 51]. The highest interspecific divergence (Fst = 0.28) between D. rotundata and D. alata reflects their divergent origins, while the lowest Fst (0.10) between D. praehensilis and D. cayenensis may suggest shared ancestry or convergent evolution under similar selection pressures. These findings reinforce the importance of conserving both wild and cultivated species to ensure long-term breeding gains [2].
From a breeding perspective, the distinct genetic clusters and country-level stratification provide a valuable framework for parental line selection. Moreover, this study demonstrates the utility of DArTseq and whole-genome resequencing platforms for high-resolution diversity analysis in yam, a crop often challenged by its polyploidy and clonal nature [52]. The consistency across ADMIXTURE, DAPC, and phylogenetic clustering confirms the reliability of the dataset and its suitability for core collection development and pre-breeding pipelines.
Conclusion
In this study, we generated an informative dataset across Africa to reveal the genetic diversity of yam. The study offers important insights into the genetic architecture and population dynamics of yam in Africa. The identified diversity patterns and population structure form a valuable foundation for breeding strategies, germplasm conservation, and trait mapping. Considering the complex domestication history, widespread informal seed systems, and broad agroecological distribution of yam, future research should prioritize the inclusion of geographically representative and genetically diverse accessions. Such efforts will strengthen genomic-assisted breeding, enhance trait discovery, and ensure the long-term conservation and utilization of Africa’s yam genetic resources.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Mawoneke KG, Kwiri R, Ndemera M. A concise review of yam (Dioscorea spp.) starch: extraction, chemical composition, physicochemical properties and its potential food applications. Cogent Food Agric [Internet]. 2025;11(1). Available from: 10.1080/23311932.2024.2447900
- 2Worojie TB, Asfaw BT, Mengesha WA. Cultivation and possible domestication of feral and possibly wild yams (Dioscorea spp.) in Southwest Ethiopia: ethnobotanical and morphological evidence. Plant Signal Behav [Internet]. 2021;16(5). Available from: 10.1080/15592324.2021.187953110.1080/15592324.2021.1879531 PMC 807850333678151 · doi ↗ · pubmed ↗
- 3Amponsah Adjei E, Esuma W, Alicai T, Bhattacharjee R, Dramadri IO, Edema R, et al. Genetic diversity and population structure of uganda’s Yam (Dioscorea spp.) genetic resource based on D Ar Tseq. P Lo S ONE. 2023;18(2):18–29. 10.1371/journal.pone.0277537 PMC 992806636787288 · doi ↗ · pubmed ↗
- 4Mekbib F, Hussein S, Gebre E, Agricultural J, Africa S, Ababa A. Genetic Variability, correlation and path analysis on the storage tuber yield and yield components of Yam (Dioscorea spp.) from Southwest Ethiopia. Genet Resour Crop Evol. 2003;4(1):98–118.
- 5Epping J, Laibach N. An underutilized orphan tuber crop—Chinese yam: a review. Planta [Internet]. 2020;252(4):1–19. Available from: 10.1007/s 00425-020-03458-310.1007/s 00425-020-03458-3PMC 750582632959173 · doi ↗ · pubmed ↗
- 6Asfaw BT, Worojie TB, Mengesha WA. Assessing morphological diversity in Ethiopian yams (Dioscorea spp.) and its correspondence with folk taxonomy. Syst Biodivers [Internet]. 2021;19(5):471–87. Available from: 10.1080/14772000.2021.1890269
- 7Adejumobi II, Agre PA, Onautshu DO, Adheka JG, Cipriano IM, Monzenga JCL, et al. Assessment of the Yam landraces (Dioscorea spp.) of DR congo for reactions to pathological Diseases, yield Potential, and tuber quality characteristics. Agric. 2022;12(5):14–21.
- 8Adewumi AS, Asare PA, Akintayo OT, Adejumobi II, Adu MO, Taah KJ et al. Genetic architecture of post-harvest tuber quality traits in Bush Yam (Dioscorea praehensilis Benth.) germplasm through association mapping. Front Hortic. 2024;3(4):24–32.
