# Towards a sweetpotato genomic-enabled breeding: optimizing two-stage analysis of multi-environment augmented trials

**Authors:** Saulo Chaves, Reuben Ssali, José Tiago B. Chagas, Kaio Olimpio G. Dias, Bert De Boeck, Thiago Mendes, Hannele Lindqvist-Kreuze, Hugo Campos, G. Craig Yencho, Guilherme da Silva Pereira

PMC · DOI: 10.1007/s00122-026-05204-x · TAG. Theoretical and Applied Genetics. Theoretische Und Angewandte Genetik · 2026-03-21

## TL;DR

This study explores efficient methods for genomic selection in sweetpotato breeding using two-stage models, aiming to optimize predictions when trials are unreplicated.

## Contribution

The study introduces optimized two-stage models using deregressed pedigree-based predictions for genomic selection in unreplicated sweetpotato trials.

## Key findings

- Using deregressed pedigree-based predictions in two-stage models slightly improves selection accuracy compared to other methods.
- For genomic predictions, the weighting scheme had a greater impact on performance than the choice of prediction input.
- Pool-specific models using deregressed pedigree-based predictions outperformed general models in prediction accuracy.

## Abstract

Using the full weight matrix and deregressed pedigree-based best linear unbiased predictions in second-stage models lead to selections and genomic predictions closer to those obtained using a single-stage model.

In multi-environment genomic selection, although single-stage (SS) models are generally more efficient (no loss of information), there are contexts where they are difficult to fit, making two-stage models the most practical alternative. An example is the evaluation of early-stage observational trials (OTs) of sweetpotato breeding, where several clones are tested in unreplicated trials. In this study, 1,138 clones derived from partial diallels within two gene pools had their storage root yield evaluated across six OTs. Using this scenario, we compared the selection and prediction performances of models under different two-stage strategies against the SS benchmark. We also tested whether pool-specific genomic prediction models offered advantages over models trained with the complete dataset. Given the lack of replication in OTs, we hypothesized that deregressed best linear unbiased predictions (dBLUPs) or pedigree-based dBLUPs (dABLUPs) would work more appropriately as inputs for second-stage models than best linear unbiased estimates (BLUEs). These comparisons were conducted within weighted models using either a diagonal weight matrix or the full weight matrix. For selection, differences among second-stage models were minor, with a slight advantage for those using dABLUPs as entries, combined with the full weight matrix. For prediction, however, the choice of weighting scheme had a greater impact on performance than the choice of entry. Using the complete dataset, differences between entries were marginal, but for pool-specific predictions, dABLUPs provided the best performance. Overall, if adopting a two-stage strategy for the analysis of augmented trials, we recommend using dABLUPs together with the full weight matrix.

The online version contains supplementary material available at 10.1007/s00122-026-05204-x.

## Linked entities

- **Species:** Ipomoea batatas (taxon 4120)

## Full-text entities

- **Diseases:** FA (MESH:C565561), SPVD (MESH:D014777), Alternaria blight disease (MESH:D004194), SS (MESH:D012640), CIP (MESH:C565467)
- **Species:** Glycine max (soybean, species) [taxon 3847], Solanum tuberosum (potatoes, species) [taxon 4113], Ipomoea batatas (batate, species) [taxon 4120]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13005861/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13005861/full.md

## References

14 references — full list in the complete paper: https://tomesphere.com/paper/PMC13005861/full.md

---
Source: https://tomesphere.com/paper/PMC13005861