# Whole-genome sequencing reveals genetic diversity, population structure, and core collection construction in Korean peach (Prunus persica) germplasm

**Authors:** Seon-Hwa Bae, Namhee Jeong, Jung Hyun Kwon, Ju-Hyun Lee, Kidong Hwang, Youn Young Hur, So Jin Lee

PMC · DOI: 10.3389/fpls.2025.1702527 · 2025-11-06

## TL;DR

This study used whole-genome sequencing to analyze 445 Korean peach varieties, revealing genetic diversity and creating a core collection for future research and breeding.

## Contribution

The study constructs the first genome-scale core collection for Korean peach germplasm using whole-genome sequencing.

## Key findings

- Over 944,670 high-confidence SNPs were identified, with chromosomes 2 and 4 showing the highest variant density.
- A representative core collection was developed, capturing most of the genetic diversity in Korean peach germplasm.
- Population structure and phylogenetic relationships revealed complex genetic variation among the accessions.

## Abstract

Peach (Prunus persica) is an important temperate fruit crop and a model species for genomic research due to its diploid genome, short juvenile period, and relatively small genome size. Despite advances in next-generation sequencing (NGS), most peach genome-wide studies focused on a limited number of elite cultivars, and thus, the diversity of conserved germplasm is underrepresented. In Korea, a large number of peach genetic resources are maintained at the National Institute of Horticultural and Herbal Science (NIHHS), a branch of the Rural Development Administration (RDA), but no genome-scale core collection has been developed to date. This study aimed to perform whole-genome sequencing (WGS) on 445 peach accessions conserved in Korea between 2020 and 2025 using the Illumina NovaSeq 6000 platform, with the primary objective of constructing a representative genome-scale core collection and secondary objectives of identifying genome-wide single-nucleotide polymorphisms (SNPs) and assessing genetic diversity, population structure, and phylogenetic relationships. A total of 944,670 high-confidence SNPs were identified, with chromosomes 2 (G2) and 4 (G4) showing the highest variant density. Analyses using fastSTRUCTURE, principal component analysis (PCA), and phylogenetic reconstruction revealed a complex population structure and substantial genetic variation. From this data, a representative core collection was established, effectively capturing the majority of the genetic diversity present in the Korean peach germplasm. These results offer valuable genomic resources for peach improvement, marker development, pan-genome construction, and comparative genomics within the Rosaceae family.

## Linked entities

- **Species:** Prunus persica (taxon 3760)

## Full-text entities

- **Species:** Prunus persica (peach, species) [taxon 3760]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12631232/full.md

---
Source: https://tomesphere.com/paper/PMC12631232