# Multivariate colorimetric phenotyping reveals genetic loci associated with soybean seed coat pigmentation and epicatechin accumulation

**Authors:** Eunsoo Lee, Sewon Park, Yeon Ju An, Jungmin Ha

PMC · DOI: 10.1007/s11032-026-01655-8 · Molecular Breeding : New Strategies in Plant Improvement · 2026-03-26

## TL;DR

This study uses color measurements to identify genetic factors linked to soybean seed coat color and a health-benefiting compound called epicatechin.

## Contribution

Introduces multivariate colorimetric analysis to link soybean pigmentation traits with epicatechin accumulation and identifies associated genetic loci.

## Key findings

- High epicatechin content correlates with specific colorimetric traits (lower L* and b*, higher a* values).
- Thirteen QTLs across multiple chromosomes are associated with seed coat color and epicatechin accumulation.
- Candidate genes like flavonoid 3′-hydroxylase and MYB transcription factors are linked to these traits.

## Abstract

Seed coat pigmentation in soybean is controlled by complex genetic mechanisms involving structural and regulatory genes in the flavonoid biosynthetic pathway. Although brown seed coats are often associated with epicatechin (EC) accumulation, visual classification alone cannot reliably predict EC content. To quantitatively characterize seed coat coloration and its relationship with EC accumulation, we evaluated multivariate colorimetric traits (L*, a*, and b* values in the CIELAB color space) in 235 recombinant inbred lines (RILs) derived from Jinpung (yellow seed coat) and IT109098 (greenish-brown seed coat). Principal component analysis (PCA) of L*, a*, and b* values revealed that genotypes with detectable EC were confined to specific regions of the multivariate color space, indicating that EC accumulation is associated with coordinated color balance rather than overall pigmentation intensity. RILs with high EC content showed significantly lower L* (32.76 ± 2.49) and b* (13.18 ± 2.66) values and higher a* values (5.47 ± 1.31) than those with low EC content. Quantitative trait loci (QTL) mapping identified thirteen loci associated with L*, a*, b*, and principal component scores across chromosomes 01, 05, 06, 08, and 19 A major locus on chromosome 08 near the classical I locus explained a large proportion of phenotypic variance in pigmentation traits. In addition, loci on chromosomes 06 and 19 were associated with integrated color components, suggesting quantitative modulation of EC accumulation. Candidate genes within these regions included flavonoid 3′-hydroxylase and transcription factors such as MYB117, MYB60, and TCP5, supported by sequence variation and differential expression analyses. These findings demonstrate that multivariate colorimetric traits provide a useful phenotyping framework for dissecting seed coat pigmentation and EC accumulation and for pre-selecting high-EC soybean lines.

The online version contains supplementary material available at 10.1007/s11032-026-01655-8.

## Linked entities

- **Genes:** MYB117 (myb domain protein 117) [NCBI Gene 839219], MYB60 (MYB transcription factor MYB60) [NCBI Gene 778080], Tcp5 (t-complex protein 5) [NCBI Gene 109768], TT7 (Cytochrome P450 superfamily protein) [NCBI Gene 830693]
- **Chemicals:** epicatechin (PubChem CID 1203)

## Full-text entities

- **Genes:** MYB117 [NCBI Gene 778087], MYB60 (MYB transcription factor MYB60) [NCBI Gene 778080], flavonoid 3'-hydroxylase [NCBI Gene 547475]
- **Chemicals:** EC (MESH:D002392), flavonoid (MESH:D005419)
- **Species:** Glycine max (soybean, species) [taxon 3847]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13022137/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13022137/full.md

## References

1 references — full list in the complete paper: https://tomesphere.com/paper/PMC13022137/full.md

---
Source: https://tomesphere.com/paper/PMC13022137