# Transcriptomic and Metabolic Insight Into Flavonoid Biosynthesis Underlying Black and Yellow Seed Coat Color Variation in Soybean ( Glycine max )

**Authors:** Kahee Moon, Prakash Basnet, Seung Young Choi, Beom‐Soon Choi, Grace Lachica, Nam‐Il Park, Kyong‐Cheul Park, Taeyoung Um, Ik‐Young Choi

PMC · DOI: 10.1002/pld3.70153 · Plant Direct · 2026-02-20

## TL;DR

This study explores how genes and chemicals work together to create black and yellow seed coat colors in soybeans.

## Contribution

The study identifies specific genes and metabolic pathways linked to soybean seed coat color variation through integrated transcriptomic and metabolic analysis.

## Key findings

- Black seed coats have higher levels of flavonoids and anthocyanins compared to yellow seed coats.
- Genes like F3H, ANS, and UGTs are upregulated in black seed coats, suggesting roles in pigment biosynthesis and transport.
- LAR genes are more active in yellow seed coats despite lower proanthocyanidin levels, indicating complex regulatory mechanisms.

## Abstract

Soybean (
Glycine max
) seed coat color variation is determined by the accumulation of flavonoid‐derived pigments, although the molecular mechanisms underlying this trait remain poorly understood. This study integrated RNA sequencing (RNA‐Seq) and high‐performance liquid chromatography (HPLC)–based metabolite measurements to investigate black and yellow seed coat soybean lines derived from the same genetic background. Metabolite analysis revealed significantly higher total phenolic content (TPC), total flavonoid content (TFC), total anthocyanin content (TAC), total proanthocyanidin content (TPAC), and antioxidant activity (DPPH, ABTS) in black seed coats, whereas yellow seed coats exhibited elevated total isoflavone content (TIC). RNA‐seq at 110 days after sowing (DAS) identified differential expression of flavonoid pathway genes associated with these metabolic differences. Genes upregulated in black seed coats included flavanone 3‐hydroxylase (F3H), anthocyanidin synthase (ANS), UDP‐glycosyltransferases (UGT78D2, UGT79B6), and glutathione S‐transferase (GSTF11), encoding enzymes reported to function in anthocyanin biosynthesis, glycosylation, and vacuolar transport, respectively. Conversely, leucoanthocyanidin reductase (LAR) genes showed higher expression in yellow seed coats despite lower proanthocyanidin (PA) levels, whereas LAC5 exhibited black seed–specific expression consistent with potential PA polymerization activity. R2R3‐MYB transcription factor genes along with small heat shock protein genes (sHSPs) were also upregulated in black seed coats, suggesting candidate regulatory roles in pigmentation and stress responses. Cytochrome P450 genes showed preferential expression in yellow seed coats, consistent with isoflavonoid pathway activation. Together, these findings elucidate the genetic and metabolic regulation of seed coat color in soybean and identify candidate genes relevant for functional breeding and genomics research.

## Linked entities

- **Genes:** F3H (flavanone 3-hydroxylase) [NCBI Gene 732548], ANS (putative 2-oxoglutarate-dependent dioxygenase) [NCBI Gene 41981027], UGT78D2 (UDP-glucosyl transferase 78D2) [NCBI Gene 831568], AT5G54010 (UDP-Glycosyltransferase superfamily protein) [NCBI Gene 835484], GSTF11 (glutathione S-transferase F11) [NCBI Gene 821227], PTPRF (protein tyrosine phosphatase receptor type F) [NCBI Gene 5792], LAC5 (laccase 5) [NCBI Gene 818630], LOC101250648 (R2R3-MYB transcription factor SlAN2-like) [NCBI Gene 101250648], CYP71B9 (cytochrome P450, family 71, subfamily B, polypeptide 9) [NCBI Gene 814788]
- **Chemicals:** flavonoid (PubChem CID 10251), anthocyanin (PubChem CID 145858), proanthocyanidin (PubChem CID 108065), isoflavone (PubChem CID 72304)
- **Species:** Glycine max (taxon 3847)

## Full-text entities

- **Genes:** C4H (trans-cinnamate 4-monooxygenase) [NCBI Gene 100499623] {aka CA4H, CYP73, CYP73A11, c4h2}, W2 (R2R3 MYB transcription factor W2) [NCBI Gene 100782308], LOC100780605 (cytochrome P450 71D8) [NCBI Gene 100780605] {aka CYP71D8}, LOC100778056 (cytochrome P450 82A4) [NCBI Gene 100778056] {aka CYP82A4}, Glutathione S-transferase [NCBI Gene 547580], MYB111 [NCBI Gene 778065], GSTF11 (glutathione S-transferase F11) [NCBI Gene 821227] {aka ARABIDOPSIS GLUTATHIONE-S-TRANSFERASE 6, ATGSTF11, ATGSTF6, GLUTATHIONE S-TRANSFERASE, T17B22.12, T17B22_12}, TT10 (Laccase/Diphenol oxidase family protein) [NCBI Gene 834862] {aka ATLAC15, AtTT10, LAC15, LACCASE-LIKE 15, MDN11.18, MDN11_18}, ANS2 (anthocyanidin synthase) [NCBI Gene 547615] {aka ANS, ANS3}, Chalcone Synthase [NCBI Gene 100170701], GSTF11 [NCBI Gene 100812502], LAR [NCBI Gene 100786879], F3H (flavanone 3-hydroxylase) [NCBI Gene 732548] {aka F3H2}, CYP82A2 (cytochrome P450 82A2-like) [NCBI Gene 100798546]
- **Diseases:** obesity (MESH:D009765), cardiovascular diseases (MESH:D002318), cancer (MESH:D009369), inflammation (MESH:D007249), arthritis (MESH:D001168)
- **Chemicals:** potassium chloride (MESH:D011189), sodium acetate (MESH:D019346), Anthocyanin (MESH:D000872), HCl (MESH:D006851), Ascorbic acid (MESH:D001205), Flavonoid (MESH:D005419), ethanol (MESH:D000431), saponins (MESH:D012503), pentose (MESH:D010429), Phenol (MESH:D019800), water (MESH:D014867), aluminum nitrate (MESH:C050609), lignin (MESH:D008031), daidzin (MESH:C013908), lipid (MESH:D008055), cysteine (MESH:D003545), ABTS (MESH:C002502), Na2CO3 (MESH:C005686), purine (MESH:C030985), genistein (MESH:D019833), glycitin (MESH:C433283), quercetin (MESH:D011794), nitrogen (MESH:D009584), Isoflavone (MESH:D007529), genistin (MESH:C040641), thiamine (MESH:D013831), phenylalanine (MESH:D010649), tannins (MESH:D013634), acetonitrile (MESH:C032159), 2,2-Diphenyl-1-Picrylhydrazyl (MESH:C004931), daidzein (MESH:C004742), glycitein (MESH:C086566), carbohydrate (MESH:D002241), oil (MESH:D009821), potassium persulfate (MESH:C009007), Coat (-), pyruvate (MESH:D019289), methionine (MESH:D008715), potassium acetate (MESH:D019347), PAs (MESH:D011478), methanol (MESH:D000432), formic acid (MESH:C030544), gallic acid (MESH:D005707), PA (MESH:C013221), glucuronate (MESH:D020723)
- **Species:** Arachis hypogaea (goober, species) [taxon 3818], Vigna radiata (mung bean, species) [taxon 157791], Nicotiana tabacum (American tobacco, species) [taxon 4097], Medicago truncatula (barrel medic, species) [taxon 3880], Glycine soja (wild soybean, species) [taxon 3848], Dimocarpus longan (longan, species) [taxon 128017], Solanum melongena (aubergine, species) [taxon 4111], Arabidopsis thaliana (mouse-ear cress, species) [taxon 3702], Glycine max (soybean, species) [taxon 3847]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12921420/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12921420/full.md

## References

76 references — full list in the complete paper: https://tomesphere.com/paper/PMC12921420/full.md

---
Source: https://tomesphere.com/paper/PMC12921420