Direct fluorescent S-genotyping reveals genetic diversity and pedigree inconsistencies in red-fleshed apple hybrids and American heritage varieties
Gergő Szendy, Dávid Polgári, Magdolna Tóth, Attila Hegedűs, Júlia Halász

TL;DR
This study improves apple breeding by identifying genetic diversity and pedigree issues in red-fleshed apple hybrids using advanced genotyping methods.
Contribution
A PCR-only S-genotyping method was optimized for precise allele identification in red-fleshed apple hybrids.
Findings
Sixteen distinct S-alleles were identified in red-fleshed apple hybrids.
Inconsistencies in pedigree records were revealed through molecular validation.
New hybrids show unique S-allele frequencies, offering potential for broader genetic diversity in breeding.
Abstract
Main conclusion A PCR-only S-genotyping was optimized for red-fleshed apple hybrids carrying the R6 allele of MdMYB10. Sixteen distinct S-alleles were identified. New hybrids show unique S-allele frequencies, enhancing breeding diversity. Red-fleshed apples are increasingly valued for their esthetic appeal and potential health benefits, making them attractive targets in modern breeding programs. In this study, 80 novel red-fleshed and six white-fleshed apple hybrids, along with their parental cultivars, were genotyped for the R6 allele of the MdMYB10 gene and for their S-alleles to assess pollination compatibility. All red-fleshed hybrids carried the R6 allele, confirming their type I red-fleshed phenotype. A previously published high-throughput S-genotyping protocol was optimized to achieve single base-pair accuracy, enabling a streamlined, PCR-only workflow without the need for…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3- —http://dx.doi.org/10.13039/100020610Magyar Agrár- és Élettudományi Egyetem
- —Hungarian University of Agriculture and Life Sciences
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPlant Gene Expression Analysis · Postharvest Quality and Shelf Life Management · Plant Reproductive Biology
Introduction
The apple is the third most popular fruit cultivated worldwide behind watermelons and bananas (FAOSTAT 2023). Besides fresh consumption, the fruit can be processed into juice, cider, purees, chips, and beauty products (Matsumoto 2014). While over 30,000 apple varieties are known, some 7,500 of these named and many cultivars are released every year (Volk et al. 2015; Sheick et al. 2020; Górnaś et al. 2022; Szot et al. 2022;). Produce of the competitors and changing demand from customers continuously compel growers to plant new cultivars in order to stay ahead of this curve (Legun 2015).
With annual sales worth billions of dollars, the apple industry relies on consistent yields, which fundamentally depend on successful fertilization. Although some accessions display parthenocarpy (Wu et al. 2024), commercially grown diploid cultivars are self-incompatible and require a pollinizer cultivar for cross-pollination to ensure economically adequate fruit set and, ultimately, profitability (Crane and Lawrence 1928; Ramírez and Davenport 2013). Sexual incompatibility between the cultivars planted in an orchard can lead to inadequate fruit set (Jackson 2003) while partial compatibility can result in less than desirable fruit set and deformed fruits due to inadequate seed set (De Witte et al. 1995; Goldway et al. 1999; Long et al. 2010).
Compatibility studies initially consisted of phenotypic incompatibility test (Kobel et al. 1939) and controlled pollination experiments with pollen tube growth tests (Modlibowska 1945) to infer the incompatibility alleles while modern methods use allele-specific PCR (Broothaerts 2003; Long et al. 2010; De Franceschi et al. 2018; Abdallah et al. 2024), CAPS (Kim et al. 2009), PCR with universal primers (Larsen et al. 2016), qPCR (López-Girona et al. 2021), and mini-sequencing (Kasajima et al. 2017), or a combination of these approaches in order to provide accurate results. Apple possesses a gametophytic self-incompatibility system, like all other plants in Rosaceae, which operates on a non-self-recognition basis. The multi-allelic S-locus encodes self-incompatibility ribonucleases (S-RNases), which are produced in the style, and a group of F-box proteins (S-locus F-box Brothers, SFBBs), which accumulate in the pollen (Sassa et al. 2007). According to the current model of non-self-recognition, each SFB protein recognizes a specific set of non-self S-RNases and marks them for proteolytic degradation through the ubiquitin–proteasome pathway (Kubo et al. 2010; De Franceschi et al. 2018; Pratas et al. 2018). Due to the large number of SFBB genes in a single haplotype, S-genotyping methods focus on identifying the S-RNase genes in apple (López-Girona et al. 2021). The discovery of new S-alleles in both cultivated and wild apple genotypes along with the necessity of incorporating them into compatibility testing underscores the need for continuous methodological development (De Franceschi et al. 2018).
S-alleles are of paramount importance during breeding as well. Not only accessions with matching S-genotypes cannot be crossed with each other, if an economically important trait is closely linked to matching S-alleles then even semi-compatible crosses could become impractical (Umemura et al. 2013). S-alleles are also used when screening progenies to confirm the parental cultivars (Sakurai et al. 2000; Long et al. 2010) and it is also an effective tool to distinguish between accessions with identical names or very similar phenotypes in gene banks and germplasm collections (Halász et al. 2011).
Red-fleshed apples present a novel possibility for both breeders and growers. These apples not only have a striking appearance but can be also considered functional foods with stronger health benefits than regular apples (van Nocker et al. 2012; Wang et al. 2015). The red flesh can be the result of two distinct mutations; these are referred to as types I and II (Volz et al. 2007). Flesh color development in type I red-fleshed apples is controlled by the upregulation of the MdMYB10 transcription factor (Espley et al. 2009); its expression level has been shown to directly correlate with the anthocyanin content in plant tissues (Chagné et al. 2013). The presence of a single repeat (R_1_) unit of 23 bp in the promoter of the MdMYB10 gene results in a white-fleshed phenotype, while the insertion of five additional tandem repeats (R_6_) enhances the ability of MdMYB10 to bind to its own promoter region and autoregulate its own expression. This mutant allele with six repeats confers the red-fleshed phenotype. Actual level of anthocyanin accumulation and thus flesh coloration differs among R_6_ red-fleshed plants and is heavily influenced by the activity of other transcription factors (Wang et al. 2018, 2024; Jiang et al. 2019).
This study aimed to screen 80 new red-fleshed and six white-fleshed hybrids, along with their parental cultivars, for the R_6_ allele of the MdMYB10 gene. This allele is associated with increased anthocyanin accumulation. This study aimed to determine the S-genotype of the hybrids to evaluate their pollination compatibility with commercial cultivars and to optimize a previously published high-throughput S-genotyping method by improving its accuracy to single-base pair resolution, enabling a PCR-only workflow and reducing the number of required reactions.
Materials and methods
Plant material
In total, 107 apple cultivars and genotypes (80 new red-fleshed hybrids, six white-fleshed hybrids, five red-fleshed and six white-fleshed parental cultivars and ten American heritage apples) were sampled. Plant material of the new hybrids and parental cultivars were provided by a breeding program of Almakúti Ltd. (Zalaszántó, Hungary). The American heritage apples were sampled in a private orchard (Budapest, Hungary). It should be noted that the cultivar referred to as ʽPilot’ in this article is the American heirloom variety discovered in the early 1800 s (Downing and Downing 1872) and not the contemporary variety also called ʽPilot’, released by the Institut für Obstforschung Dresden-Pillnitz in 1988 (Fischer and Fischer 2004). In all cases, young leaves were collected 3–4 weeks after bud break in the spring and stored at − 20 °C until DNA extraction.
DNA extraction and PCR analysis
Genomic DNA was extracted from healthy young leaves using DNeasy Plant Mini Kit (Qiagen, Hilden, Germany) and E.Z.N.A.^®^SP Plant DNA Kit (Omega Bio-tek, Norcross, GA, USA) according to the manufacturers’ specifications. To amplify Salleles, the following consensus primers were used: ASPF3-F (Kim et al. 2006), EIIWPN-R, S3/S5/S10-R, S16-R and S25-R together with the S8 allele-specific S8F and S8R primer pair (Larsen et al. 2016). The results were validated using 13 allele-specific primer pairs: FTC168-FTC169, OWB122-OWB123, FTC141-FTC142, FTC143-FTC144, FTC154-FTC155, FTC231-FCT232, FTC229-FTC230, FTC177-FTC226, FTC10-FTC11, FTC12-FTC228 (Broothaerts 2003), MdS_11_SpF-MdS_11_SpR, MdS_21_SpF-MdS_21_SpR, and MdS_44_SpF-MdS_44_SpR (Long et al. 2010). Detailed information on all primers used in this study is provided in Supplementary Table S1. For the analysis of MdMYB10 alleles, the primers described by Espley et al. (2009) (GGAGGGGAATGAAGAAGAGG and TCCACAGAAGCAAACACTGAC) were used following the published protocol.
Reactions using DreamTaq™ polymerase (Thermo Fisher Scientific, Waltham, MA, USA) were performed in a total volume of 20 µl, containing: 40–80 ng DNA sample, 10 × DreamTaq™ Green buffer; 1.2 mM MgCl_2_, 0.15 mM dNTP, 0.4 μM forward and reverse primers, 0.65 U DreamTaq™ DNA polymerase, 3% DMSO, and 0.01 μg/μl BSA. Reactions using Phire Hot Start II polymerase (Thermo Fisher Scientific, Waltham, MA, USA) were performed in a total volume of 15 µl, containing: 10 µg of DNA, 5 × buffer, 200 µM of each dNTP, 0.5 µM of both primers, 0.3 µl of Phire Hot Start II DNA Polymerase, 3% DMSO, and sterile distilled water to complete the volume. Reactions using either Geneaid™ Hot Start Taq Master Mix with Dye (Geneaid Biotech Ltd, Taiwan) or Canvax HotBegan^™^ Hot Start Green Taq Master Mix (Canvax Reagents SL, Valladolid, Spain) were carried out according to the manufacturers’ recommended protocols. For fragment length detection, the ASPF3-F forward primer was labeled with the FAM fluorescent probe. Reactions were carried out in Swift Max Pro (ESCO Healthcare, Singapore) and Genesy 96 T (Tianlong, Xi’an, China) thermocyclers. PCR cycling conditions for each primer pair are summarized in Supplementary Table S2.
Fragment length determination
PCR products were first visualized on a 1% TBE agarose gel (15–20 min at 80 V) under UV light after staining with ethidium bromide. For accurate size determination, the forward primer ASPF3-F was labeled with FAM fluorescent dye for the ABI PRISM 3100 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA) automated DNA sequencer. The results obtained were evaluated using the Peak Scanner 3.0 software (Applied Biosystems Sizing Analysis Module) with a GS500 LIZ (Applied Biosystems) size standard.
Cloning and sequencing
Cloning of PCR products was carried out using the pTZ57R/T vector (Thermo Fisher Scientific, Waltham, MA, USA). The ligated plasmid vectors were transformed into JM109 Escherichia coli competent cells (Zymo Research, Irvine, CA, USA). The size of the cloned sequences was determined on agarose gels using M13 primers. The plasmid DNA fragments were purified using the EZ-10 Spin Column Plasmid DNA kit (Bio Basic, Markham, ON, Canada) and then sequenced at least three repetitions in the forward direction using the M13 sequencing primers in an ABI PRISM 3100 Genetic Analyzer (Thermo Fisher Scientific, Waltham, MA, USA) automated sequencer. Homology searches were carried out using the Basic Local Alignment Search Tool (BLAST) provided by the National Center for Biotechnology Information (NCBI, Bethesda, MD, USA) (Altschul et al. 1990).
Ploidy analysis
The ploidy levels for the American heritage apples were determined by flow cytometry using a Sysmex CyFlow Space Flow Cytometer (Sysmex Partec GMBH, Görlitz, Germany) equipped with a 365 nm wavelength UV LED diode. For nuclei isolation and staining, because of the high polyphenol content of apple tree leaves, a one-step CyStain UV OxProtect kit (Sysmex Partec GMBH, Görlitz, Germany) was used according to the manufacturer’s recommended protocol. Young leaves were freshly collected for the measurements, and approximately 1 cm^2^ of fresh leaf segments were chopped for one minute with a sharp razor blade in a 50 mm diameter plastic Petri dish after adding 500 µL of CyStain UV OxProtect Staining Solution (Sysmex Partec GMBH, Görlitz, Germany). The shredded leaves were incubated for one minute in the dish, and then the liquid fraction was transferred to the sample tube through a 50 µm CellTrics filter (Sysmex Partec GMBH, Görlitz, Germany). The suspension was washed into the sample tube with an additional 2 mL of CyStain UV OxProtect Staining Solution. After a five-minute incubation, the DNA content (> 10.000 nuclei/sample) was measured using UV LED diode illumination. Relative genome size was determined using Flomax 2.11 software (QA GmbH, Münster, Germany). The G_0_/G_1_ and G_2_/M peaks were determined by peak analysis method. Ploidy of ʽWaltana’, ʽPink Parfait’ and the other heritage apples were assessed by comparing the measured G_0_/G_1_ genome sizes to the diploid ʽKing David’ and the triploid ʽJonagold’ control samples.
Evaluation of fruit flesh color intensity
The intensity of red coloration in apple fruit flesh was visually assessed using at least five representative fruits per accession and scored on a scale from 1 to 5, where 1 = faint pinkish tinge, 2 = moderate pink, 3 = pink-red, 4 = moderate red, and 5 = dark red; characteristic examples are shown in Fig. 1.Fig. 1. Classes of visually assessed red-flesh color intensity in apple, scored on a scale of 1 (faint pinkish tinge), 2 (moderate pink), 3 (pink-red), 4 (moderate red), and 5 (dark red)
Results
S-allele detection of hybrids and cultivars
PCR amplification of the 86 new hybrids and the 11 parental cultivars using the ASPF3-F forward primer in combination with the EIIWPN-R reverse primer produced 11 distinct fragments smaller than 545 bp, and three additional fragments with the S3/S5/S10-R reverse primer (Table 1). In addition, ASPF3-F and EIIWPN-R amplicons larger than 1 kbp were also observed on agarose gels, corresponding to the S3-, S5-, and S10-alleles. These results were confirmed using the S3/S5/S10-R reverse primer, which specifically amplifies these alleles and yields distinct fragment sizes for each. The amplicons below 1 kbp were analyzed with capillary electrophoresis; all peaks were high, distinct, and appeared without stuttering for each sample (Table 1). The closest sizes were observed with the hybrid X149 in which the S1-and S24 -alleles resulted in fragment lengths of 545 bp and 540 bp, respectively, which were clearly visible as two distinct peaks on the electropherogram. The S16-allele was amplified in 12 red-fleshed hybrids and one new white-fleshed cultivar (ʽIsolda’) using the ASPF3-F forward and the S16-R reverse primers; the 447 bp size corresponded to the S16b-allele in all cases. The ASPF3-F forward and S25-R primer pairs did not amplify products in any of the samples; the S8-allele was detected only in one red-fleshed hybrid (X303) and one white-fleshed cultivar (ʽBellona’) using the allele-specific S8-F and S8-R primer pair. One allele amplified from Malus purpurea ʽRoyalty’ was detected as a 475 bp fragment that corresponds to the S31-allele. Table 1. Detection of S-alleles using fragment size analysis determined by a capillary sequencer and PCRs with allele-specific primersS-alleleSize (bp, in this study)Size (bp, Larsen et al. 2016)Allele-specific primersSequenced cultivar or hybridAccession numberASPF3-F and EIIWPN-RS_1_544559–562FTC168FTC169ʽWaltana’PV693751S_2_350370–372OWB122OWB123––S_6_369390–392FTC141FTC142––S_7_320340–342FTC143FTC144––S_9_349367–369FTC154FTC155T184PV693744S_17_369390–392FTC141FTC142ʽPilot’, ʽPink Parfait’, ʽRubaiyat’, ʽWaltana’PV693747, PV693748, PV693749, PV693746S_20_516535–537FTC141FTC142––S_24_540556–558FTC231FTC232––S_28_371392–394FTC229FTC230––S_31_475496–497–Malus purpurea ʽRoyalty’PV693752S_44_375396 (expected)MdS_44_SpFMdS_44_SpR––ASPF3-F and S3/S5/S10-RS_3_400425–427FTC177FTC226––S_5_378401–403FTC10FTC11ʽKatherine’PV693750S_10_360382–383FTC12FTC228ʽWaltana’PV693745ASPF3-F and S16-RS_16b_447469–470–––
S-alleles for 10 American heritage apple varieties were also determined (Table 2). PCR reactions amplified two S alleles in ʽChestnut Crab’, ʽIngram’, ʽKatherine’, ʽKing David’, ʽMuscat de Venus’, ʽPilot’, ʽRubaiyat’ as well as ʽWickson Crab’ in using the ASPF3-F forward and EIIWPN-R and S3/S5/S10-R reverse primers, while in ʽPink Parfait’ (S2S3S17) and ʽWaltana’ (S1S10S17) three S-alleles were detected. Table 2S-allele constitution of American heritage varietiesVarietyS-alleleVerification methodS-alleleVerification methodS-alleleVerification methodChestnut CrabS_9_FTC154 + FTC155 SequencingS_20_FTC141 + FTC142––IngramS_1_FTC168 + FTC169S_9_FTC154 + FTC155––KatherineS_5_FTC10 + FTC11, SequencingS_7_FTC143 + FTC144––King DavidS_1_FTC168 + FTC169S_24_FTC231 + FTC232––Muscat de VenusS_5_FTC10 + FTC11S_24_FTC231 + FTC232––PilotS_7_FTC143 + FTC144S_17_FTC141 + FTC142 Sequencing––Pink ParfaitS_2_OWB122 + OWB123S_3_FTC177 + FTC226S_17_FTC141 + FTC142 SequencingRubaiyatS_3_FTC177 + FTC226S_17_FTC141 + FTC142 Sequencing––WaltanaS_1_FTC168 + FTC169 SequencingS_17_FTC141 + FTC142 SequencingS_10_FTC12 + FTC228 SequencingWicksonS_3_FTC177 + FTC226S_5_FTC10 + FTC11––
Verification of the results
Since the resulting fragments lacked the M13 tail used for fluorescent labeling by Larsen et al. (2016), the amplicon sizes were not directly comparable between the studies. For verification allele-specific primers (Broothaerts 2003; Long et al. 2010) were utilized for the S1–S3,* S5, S*7, S9, S10, S6/17/20, S24, S28, and S44 alleles. Amplification efficiency varied among polymerases. The S7 and S9 allele-specific products (FTC143–FTC144 and FTC154–FT155 primers) amplified weakly with Taq polymerase, whereas both alleles were strongly amplified using hot-start polymerase enzymes.
The FTC177 and FTC226 primers (Broothaerts 2003) amplifying the S3-allele did not work with the (C) PCR protocol, but using the modified (A) protocol detailed in Materials and Methods produced strong bands with light smearing in lanes where the S3 allele was present. The S10 allele-specific primers amplified not only the S10 but also the S3-allele, although somewhat weaker than the S3 allele-specific primers did. Therefore, the confirmation of the S10-allele required two PCR reactions with both the S3 and S10 allele-specific primers.
To verify the results for the heritage apple PCR products, allele-specific primers (Broothaerts 2003) were used, and nine amplicons were also cloned and sequenced (deposited in GenBank under accession numbers PV693744–PV693752) to confirm all detected alleles. Using PCR products from the ASPF3-F and EIIWPN-R reaction, the S17-allele was cloned from ʽPilot’, ʽPink Parfait’, ʽRubaiyat’ and ʽWaltana’ (PV693747, PV693748, PV693749 and PV693746, respectively). The fragment from ʽWaltana’ was 97.3% identical to the GenBank S17 sequence (MG598499) with 98.5% matching amino acids in the translated proteins while the other three fragments were 100% identical to the GenBank S17 sequence. In addition, the S1 (PV693751) and S10 (PV693745) alleles were cloned and sequenced from ʽWaltana’ and proved to be identical to the GenBank S1 (MG598487) and S10 (MG598496) sequences, respectively.
The presence of the S1, S2, S3, S5, S7, S9, S10, S17, S20, and S24-alleles in the heritage apples all confirmed using allele-specific primers (Broothaerts 2003), as well. Since the allele-specific primers FTC10 and FTC11 amplified the S5-allele unreliably, the amplicon from ʽKathreine’ using the ASPF3-F and S3/S5/S10-R primer pair was cloned and sequenced (PV693750) to confirm the identity of the allele; the fragment showed significant homology (E value: 5e-169) to the database apple S5-RNases with a single synonymous base substitution.
The S9- and S46-alleles (Long et al. 2010) are also very similar to each other and the sizes of the fragments amplified from these alleles using the consensus primers differ only by 1 bp. Therefore, the S9 allele-specific PCR products were sequenced to determine the specific alleles present in ʽChestnut Crab’, T45 and T184 (PV693744). In all three cases, the sequences were identical to the GenBank S9 sequence (MG598495). The S31-allele was cloned and sequenced from Malus × purpurea ʽRoyalty’ (PV693752) and 99.8% identical to the S31* Malus domestica* allele (DQ135990) with no amino acid replacements in the coded proteins.
The fragment lengths for ʽKing David’ using the ASPF3-F and EIIWPN-R primers indicated an S genotype of S1S24 that is further confirmed by negative S9 allele-specific and positive S24 allele-specific PCR results, which is incongruent with the commonly accepted S1S9 genotype for this variety (Morita et al. 2009).
Ploidy levels
Because two heritage apples (ʽPink Parfait’ and ʽWaltana’) were shown to carry three S-alleles, their ploidy levels were determined using flow cytometry. The G_0_/G_1_ nuclei of the diploid control (ʽKing David’) showed a relative genome size of 71.86 with a CV value of 9.11. A smaller G_2_/M peak was also detected at X = 144.47 with a CV of 6.07. For the triploid control (ʽJonagold’), the G_0_/G_1_ nuclei measured 102.54 with a CV of 7.64, while the G_2_/M peak measured 203.42 with a CV of 5.88. These measurements confirmed that the genome size of the well-known triploid ʽJonagold’ is approximately one and a half times (143%) the genome size of the diploid ‘King David’ sample.
For ʽWaltana’, the G_0_/G_1_ peak was detected at 72.5 with a CV of 6.82, and the G_2_/M peak at 145.75 with a CV of 3.97. For ʽPink Parfait’, the G_0_/G_1_ peak was detected at 69.72 with a CV of 7.28, and the G_2_/M peak at 144.38 with a CV of 4.37. These values (101.88% and 97.98% relative to the diploid control) correspond to a diploid genome size, with deviations within 3% in both cases (Fig. 2).Fig. 2. Histograms show the relative genome size of three different genotypes as determined by flow cytometry. The G_0_/G_1_ peak of the ‘Waltana’ genotype was detected at X = 71.65, which corresponds to the diploid control ʽKing David’ (G_0_/G_1_ peak at X = 70.46). The triploid control ʽJonagold’ showed a G_0_/G_1_ peak at X = 102.17. Secondary G_2_/M peaks were also observed in all samples: X = 140.41 (ʽKing David’), X = 202.83 (ʽJonagold’), and X = 141.23 (ʽWaltana’), confirming the diploid genome size of the ʽWaltana’ genotype
Genotyping for the red-flesh mutation
Using the method described by Espley et al. (2009) revealed that the promoter of the MdMYB10 gene in all 80 new red-fleshed hybrids contained the six minisatellite repeats (R_6_) in either heterozygous (R_1_R_6_) or homozygous (R_6_R_6_) form (Table 3), confirming that all plants are type I red-fleshed mutants. This is not surprising as the mutant phenotype was evident from the red color of the first true leaves, making it useful for selection by breeders. The intensity of pigmentation in apple fruit flesh was not associated with the heterozygous or homozygous state of the R_6_ allele in the analyzed accessions as the red color intensity of R_1_R_6_ genotypes ranged from class 1 to 5 (Table 3). Table 3S-genotype and MdMYB10 gene variants of red-fleshed hybrids and parental cultivarsVariety/hybridMdMYB10Red color intensityS-genotypeParental cultivarsRed-fleshed parental cultivarsMalus ʽGeneva’R_1_R_6_2S3S44_ʽAlmandin’R_1_R_6_4S_1S3Malus ʽRoberts’R_6_R_6_5S1S20Malus denboerii ʽEvelyn’R_6_R_6_5S1S20Malus × purpurea ʽRoyalty’R_6_R_6_5S17S31_White-fleshed parental cultivarsʽQuerina’R_1_R_1–S3S9_ʽArtemisz’R_1_R_1–S2S7_ʽRosmerta’R_1_R_1–S9S10_E87R_1_R_1–S3S16b_ʽReleika’ × UnknownA376R_1_R_1–**S2/10S28_ʽGloster 69’ (S28S?) × ʽPrima’ (S2S10)A56R_1_R_1–S2S_7_ʽPrima’ (S2S10) × UnknownNew red-fleshed hybridsT94R_1_R_6_4S2S_20_ʽArtemisz’ (S2S7) × Malus denboerii ʽEvelyn’ (S1S20)T55R_1_R_6_3S2S_20_X230R_1_R_6_3S7S_20_X231R_1_R_6_3S7S_20_X232R_1_R_6_4S2S_20_X233R_1_R_6_3S2S_20_X235R_1_R_6_4S2S_20_X236R_1_R_6_4S7S_20_X237R_1_R_6_4S2S_20_X239R_1_R_6_3S1S_2_X240R_1_R_6_4S2S_20_X241R_1_R_6_3S2S_20_X242R_1_R_6_4S2S_20_X243R_1_R_6_3S2S_20_X244R_1_R_6_4S1S_2_T41R_1_R_6_3S1S_7_Supposed as ʽRosmerta’ (S9S10) proved as ʽArtemisz’ (S2S7) × Malus denboerii ʽEvelyn’ (S1S20)T47R_1_R_6_4S2S_20_X258R_1_R_6_4S20S44Malus ʽGeneva’ (S3S44) × Malus ʽRoberts’ (S1S20)X266R_1_R_6_4S20S_44_X147R_1_R_6_4S1S_10_Supposed as ʽQuerina’ (S3S9) proved as unknown (S10S24) × Malus ʽRoberts’ (S1S20)X149R_1_R_6_2S1S_24_X150R_1_R_6_4S1S_10_X151R_1_R_6_4S10S_20_X152R_1_R_6_4S10S_20_X148R_1_R_6_2S9S_20_ʽQuerina’ (S3S9) × Malus ʽRoberts’ (S1S20)X153R_1_R_6_3S9S_20_X154R_1_R_6_3S1S_3_X238R_1_R_6_3S1S3Malus ʽGeneva’ (S3S44) × ʽAlmandin’ (S1S3)X272R_1_R_6_n.dS1S_44_X273R_1_R_6_4S1S_3_X274R_1_R_6_5S1S_44_X275R_1_R_6_1S1S_44_X276R_6_R_6_n.dS1S_3_X277R_1_R_6_n.dS1S_3_X278R_1_R_6_5S1S_44_X279R_6_R_6_5S1S_44_T97R_1_R_6_4S1S_44_T98R_1_R_6_4S1S_44_T40R_1_R_6_1S9S_20_ʽRosmerta’ (S9S10) × Malus denboerii ʽEvelyn’ (S1S20)T45R_1_R_6_3S9S_20_T53R_1_R_6_3S1S_10_T255R_1_R_6_3S1S_10_T264R_1_R_6_4S10S_20_T138R_1_R_6_2S20S_28_A-376 (S2/10S28) × supposed as Malus × purpurea ʽRoyalty’ (S17S31) and proved as Malus ʽRoberts’ (S1S20)T139R_1_R_6_2S20S_28_T141R_1_R_6_2S20S_28_T142R_1_R_6_3S1S_28_T143R_1_R_6_2S1S_28_X286R_1_R_6_4S1S_3_E-87 (S3S16b) × Malus denboerii ʽEvelyn’ (S1S20)X287R_1_R_6_3S16bS_20_X288R_1_R_6_4S1S_16b_X289R_1_R_6_4S1S_16b_X290R_1_R_6_4S1S_16b_X291R_1_R_6_4S3S_20_X292R_1_R_6_3S16bS_20_X293R_1_R_6_4S16bS_20_X294R_1_R_6_4S1S_16b_X295R_1_R_6_3S1S_16b_X296R_1_R_6_4S16bS_20_X297R_1_R_6_4S1S_16b_X298R_1_R_6_2S3S_20_X299R_1_R_6_4S16bS_20_X300R_1_R_6_4S1S_3_X301R_1_R_6_3S1S_3_X302R_1_R_6_2S3S_20_X304R_1_R_6_4S3S_20_X305R_1_R_6_4S16bS_20_T96R_1_R_6_5S1S_3_T164R_1_R_6_3S2S_3_A-56 (S2S7) × ʽAlmandin’ (S1S3)T165R_1_R_6_2S1S_7_X234R_1_R_6_4S1S_28_Unknown × Malus denboerii ʽEvelyn’ (S1S20)X303R_1_R_6_5S1S_8_T60R_1_R_6_2S7S_20_ʽHesztia’ (S7S10) × Malus denboerii ʽEvelyn’ (S1S20)T123AR_1_R_6_2S2S_3_unknownT163R_1_R_6_4S2S_3_A347 × ʽAlmandin’ (S1S3)T184R_1_R_6_2S9S_44_ʽQuerina’ (S3S9) × Malus ʽGeneva’ (S3S44)T249R_1_R_6_2S9S3Malus* ʽGeneva’ (S3S44) × UnknownX253R_1_R_6_3S1S_2_A-376 (S2/10S28) × Malus × purpurea ʽEleyi’X283R_1_R_6_3S1S_3_ʽArtemisz’ (S2S7) × ʽAlmandin’ (S1S3)J561R_1_R_6_2S9S20_unknownNew non-red-fleshed cultivars and hybrids:ʽIsolda’R_1_R_1–S16bS20_ʽVeritas’R_1_R_1–S1S3_ʽBellona’R_1_R_1–S3S8_ʽFreedom’ × ʽFlorina’ (S3S9)G103R_1_R_1–S9S28_ʽSnygold’ (Earligold) × ʽBaujade’I142R_1_R_1–S2S9_I233R_1_R_1–S2S9^*^S-genotypes inferred based on progeny S-genotypes; n.d., no data
Discussion
Methods and verification
Base pair-accurate amplicon sizes from consensus primer assays were determined and confirmed by allele-specific primers and sequencing for the* S*1, S2, S3, S5, S6, S7, S9, S10, S16b, S17, S20, S24, S28, S31, and S44-alleles (Table 1). The results obtained with the directly labeled forward primer were consistent and accurate to the base pair across numerous alleles and PCR reactions using DreamTaq™, Geneaid and Canvax HotBegan™ polymerases and master mixes, various reagents and DNA extracted with different kits.
This improvement was motivated by two key limitations identified in the assay developed by Larsen et al. (2016), which may have hindered the broader adoption. First, the forward primers incorporated an M13 tail that enabled indirect fluorescent labeling using a universal M13 primer (Schuelke 2000), an approach that can introduce systematic shifts in fragment size. Therefore, allele sizes reported in that study are not directly comparable with data obtained using directly labeled primers, even when identical consensus primers are used.
Fragment size differences between results using the M13 universal tail (Larsen et al. 2016) and the directly labeled forward primer in this study ranged from 15–18 bp (S1) to 25–27 bp (S3). The data indicates that size alterations were not solely attributable to the length of the M13 tail (19 bp). Larsen et al. (2016) reported fragments that were most frequently larger, and occasionally smaller, than the expected size by 1–3 bp. Size alterations between the directly labeled primers used in our study and the fragments amplified by M13 tail primer (Larsen et al. 2016) ranged from –3 to + 9 bp. Similarly, Mair et al. (2017) also observed larger deviations of 5–9 bp for several alleles. Regardless of the cause, deviations of this magnitude (≤ 12 bp) may introduce uncertainty in allele identification and hinder reliable comparisons across studies. Of the 36 alleles with known fragment sizes (Larson et al., 2016), 48 pairs of alleles may not be resolved if the accuracy is limited to 12 bp as reported for M13 tail primers.
In addition, the consensus primers themselves can generate fragments of identical or nearly identical size for certain allele pairs. Larsen et al. (2016) characterized those fragments by restriction digestion, an approach also employed by Alessandri et al. (2024) to differentiate amplicons generated with the PyCom C1F1 and PyCom C5R1 consensus primers. To avoid the high costs and labor associated with restriction enzymes, a PCR-only approach was developed to facilitate and streamline the workflow.
In breeding programs, unless there are multiple parental S-alleles present that generate fragments of identical length, no extra work is required as the PCR results in our approach are base pair accurate and thus unambiguous. If the specific alleles are present, the S16-R and S25-R reverse primers can be multiplexed with the S3/S5/S10-R reverse primers, as proposed by Larsen et al. (2016). Allele-specific primers are then only required to distinguish between fragments of identical size or to identify missing alleles. This approach significantly reduces the number of PCR reactions and associated costs. In population surveys where parentage is unknown, the consensus primers remain accurate at the base-pair level; however, care should be taken to confirm whether previously unidentified alleles with matching or similar amplicon sizes are present.
Protocol minimal
Based on combined experimental and in silico analysis, we propose the following modifications to the protocol of Larsen et al. (2016) to resolve ambiguities and improve the reliability of allele discrimination (S1–S47), while minimizing the number of PCR reactions and eliminating the need for restriction enzyme digestion. For distinguishing between the S9 and S46 (1 bp difference) or the S40 and S46-alleles (2 bp difference), the MdS_46_SpF and MdS_46_SpR primer pairs (Long et al. 2010) can be used to detect the presence or absence of the S46-allele. In case of the S9 and S40-alleles (1 bp difference), the S9 allele-specific primers FTC154 and FTC155 (Broothaerts 2003) will not amplify the S40-allele. In the case of the S2 and S_23_-alleles (identical sizes), the OWB122 and OWB123 S2 allele-specific primers (Broothaerts 2003) will not amplify the S23-allele. To differentiate between the S11 and S21 (2 bp difference), the S21 and S44 (identical sizes), and the S11 and S44-alleles (2 bp difference), the MdS_21_SpF-MdS_21_SpR and MdS_44_SpF-MdS_44_SpR primer pairs (Long et al. 2010) can be used to detect the presence or absence of the S21 or S44 alleles. In case of the S20 and S45-alleles (1 bp difference), FTC141 and FTC142 primers (Broothaerts 2003) can be used as these will not amplify the S45-allele.
The S6/S17 and S28-alleles (2 bp difference) can be easily discerned using any of the available allele-specific primers. The fragments amplified from the S6 and S17-alleles using the consensus primers are identical in size (369 bp) but differ by five base substitutions (De Franceschi et al. 2018). The primer binding sites for the FTC141 and FTC142 primers (Broothaerts 2003), based on the full sequences (MG598492 and MG598499), are identical in both alleles making this primer pair incapable of discriminating between these two alleles. The fragments amplified with the consensus primers can also be cloned and sequenced for critical applications or the S17 amplicons can be identified based on a unique restriction site for MluI at 255 bp (providing fragments of 255 and 114 bp). However, the functional distinction between S6 and S17 has not yet been demonstrated (De Franceschi et al. 2018), and an increased S-genotype resolution is only warranted if the two alleles are functionally distinct.
Any other S-allele combinations published by Larsen et al. (2016) differ by at least 3 bp in length and discerning between these should be straightforward using the electropherograms of directly labeled base pair accurate amplicons even without prior knowledge of possible S-alleles present in the samples.
Genotyping the new hybrids
In order to S-genotype the new red-fleshed hybrids, the 11 parental cultivars were to be genotyped first. However, plant material was unavailable for the parents E87, A376, and A56. E87 is an open pollinated seedling of ʽReleika’, itself with an undetermined S-genotype, A376 is a cross between ʽGloster 69’ (S28S?, Morita et al. 2009) and ʽPrima’ (S2S10, Tóth et al. 2012) and A56 is an open pollinated seedling of ʽPrima’. Based on these data, the expected S-allele pool of the new red-fleshed hybrids was determined as S1–S3, S7, S9, S10, S17, S20, S31, and S44. During the analysis, the S8, S16, and S28-alleles were also detected. These results and the pedigree records helped assign the S3S16b and S2S7 genotypes to E87 and A56, respectively. The results for A376 were inconclusive as all its progenies inherited the S28-allele (transferred from ʽGloster 69’), making the S-genotype of A376 either S2S28 or S10S28.
Results of the new red-fleshed hybrids also pointed to procedural errors during pollination and/or seed handling in some cases. T41 and T47 were supposed to be crosses between ʽRosmerta’ (S9S10) and Malus denboerii ʽEvelyn’ (S1S20). However, their S-genotypes are S1S7 for T41 and S2S20 for T47. ʽArtemisz’ (S2S7) was determined as a seed parent in both cases. T138–T143 were all recorded as crosses between A376 (S2/10S28) and Malus purpurea ʽRoyalty’ (S17S31); however, none of these hybrids carry the S17 or S31 alleles; the pollinating parent was determined to be Malus ʽRoberts’ (S1S20) based on S-alleles present in all five hybrids. The X147–154 hybrids were supposed to be crosses between ʽQuerina’ (S3S9) and Malus ʽRoberts’ (S1S20); however, neither of the hybrids carry the S3- or S9-alleles. Instead, the second allele of X149 is S24 and for the others, it is S10, and the source of those alleles could not be identified.
S-genotyping of American heritage apples
In the American heritage apples, two alleles were amplified in seven varieties each supporting their diploid status; however, in ʽPink Parfait’ and ʽWaltana’, the consensus primers amplified three alleles. The presence of all detected alleles in the heritage apples was confirmed with allele-specific primers (Broothaerts 2003; Long et al. 2010) and the S1, S10, and S17-alleles amplified from ʽWaltana’ were also cloned and sequenced. The S1 and S10 amplicons were 99.8% to 100% identical to the GenBank sequences; the S17 nucleotide sequence was 97.3% identical to the GenBank sequence with one asparagine to lysine conservative replacement resulting in a 98.5% match in the encoded protein sequence. Ploidy levels were also confirmed for all heritage apples using flow cytometry, and all were determined to be diploid. Pinpointing the exact phenomenon behind the results requires further, more focused research. A segmental duplication on chromosome 17 involving one S-allele may explain these results, similar to mutations reported in Pyrus (Mase et al. 2014; Nishio et al. 2024). The occurrence of this putative mutation in two genotypes can be attributed to the shared origin of the ʽWaltana’ and ʽPink Parfait’ varieties (Gruenfeld and Hill 2025).
S-allele frequencies
The high frequency (23%) of S1- and S20-alleles (Fig. 3) aligns with the extensive use of Malus ʽRoberts’ and Malus denboerii ʽEvelyn’ as R6 donors. In addition, each of these alleles appeared in other parental cultivars (S1 in ʽAlmandin’ and S20 in ʽIsolda’). Although the S-allele composition of the new hybrids is predetermined due to the small number of parental cultivars used, the S2- and S3-alleles are still present in 11–12% of the new hybrids. This ratio is much lower than in other populations; the worldwide overrepresentation of these two alleles was previously linked to the overuse of disease-resistant accessions as parents (Nybom et al. 2008; Alessandri et al. 2024). Of the 11 parental cultivars, six are resistant to apple scab, powdery mildew and/or fire blight; three of these carry the S2- and another three the S3-allele, yet both can be considered underrepresented compared to other populations (Broothaerts et al. 2004; Nybom et al. 2008; Halász et al. 2011; Larsen et al. 2016; De Franceschi et al. 2018; Alessandri et al. 2024). Compared to traditional Hungarian varieties (Halász et al. 2011), the S7- and S24-alleles are significantly less common and the S5 is completely absent in the new hybrids. Although the S24-allele was not present in the parents used for crossbreeding, it is relatively common in Hungarian germplasm, and several cultivars carrying it were present in the orchard where pollination work was done, suggesting these as the likely source of the allele in the X149 hybrid.Fig. 3S-allele frequency distribution among the new red-fleshed hybrids S-genotyped in this work (blue), among the most popular commercial cultivars worldwide (red, Broothaerts, 2004) and among the old Hungarian cultivars and landraces (green, Halász et al. 2011). Results under S16b include S22, S23 and S25 from Broothaerts (2004) work based on these alleles coding for the same protein as S16b (Matsumoto and Furusawa, 2005)
The characteristic differences in allele frequencies suggest that the hybrids could serve as promising accessions for breeding programs aimed at increasing genetic diversity in pathogen-resistant cultivars.
Conclusions
An optimized high-throughput S-genotyping protocol for apple is reported here, providing base pair-accurate allele identification without reliance on restriction enzyme digestion. Our protocol facilitates direct comparison of data across studies. We applied this method to 80 novel red-fleshed and six white-fleshed apple hybrids, as well as their parental cultivars, revealing 16 distinct S-alleles and resolving several methodological inconsistencies reported in previous studies. All red-fleshed hybrids were confirmed to carry the R_6_ allele of the MdMYB10 gene, validating their type I red-fleshed phenotype. The S-genotyping results also uncovered discrepancies in pedigree records and pollination procedures, highlighting the utility of molecular markers in breeding program quality control. Ploidy analysis of heritage apples confirmed diploidy despite the presence of three S-alleles, suggesting possible segmental duplications requiring further investigation. The observed S allele frequencies differed from those in traditional Hungarian and international germplasm, indicating that these hybrids may contribute to enhancing genetic diversity in disease-resistant apple breeding. Overall, the standardized, high-resolution and efficient genotyping approach and the characterized hybrids provide valuable tools and resources for future apple breeding and germplasm management.
Supplementary Information
Below is the link to the electronic supplementary material.Supplementary file1 (DOCX 20 KB)
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1FAOSTAT (2023) Agriculture Data Available from: http://www.fao.org/faostat/en/#data/QC. Accessed 22 May 2025
- 2Gruenfeld, G, Hill, P (2025) Pomiferous https://pomiferous.com
