# Genome-Wide Identification of Candidate Loci Associated with the Regulation of the Protein, Oil, and Carbohydrate Content in Soybean

**Authors:** Sreeparna Chowdhury, Byeong Hee Kang, Won-Ho Lee, Seo-Young Shin, Da-Yeon Kim, Woon Ji Kim, Jeong-Ho Baek, Seong-Hoon Kim, Bo-Keun Ha

PMC · DOI: 10.3390/plants15060924 · Plants · 2026-03-17

## TL;DR

This study identifies genetic loci in soybean that influence the protein, oil, and carbohydrate content of seeds, offering targets for improving soybean nutritional quality.

## Contribution

The study discovers stable QTLs and candidate genes for seed composition traits in soybean using GWAS and haplotype analysis.

## Key findings

- Nine significant loci across multiple chromosomes regulate seed protein, oil, and carbohydrate content.
- Three pleiotropic loci influence multiple compositional traits, indicating shared genetic control.
- Five candidate genes are linked to carbon allocation and metabolic pathways affecting seed composition.

## Abstract

Soybean (Glycine max (L.) Merr.) is a globally important legume valued for the high protein, oil, and carbohydrate content of its seeds. However, it is difficult to simultaneously optimize the content of these three macronutrients due to their negatively correlated metabolic pathways and complex quantitative inheritance patterns. In this study, a genome-wide association study (GWAS) was conducted to elucidate the genetic architecture underlying the seed protein, oil, and carbohydrate content in 328 soybean accessions evaluated over two years using near-infrared spectroscopy. Significant negative correlations (r = −0.323 to −0.656, p < 0.001) were observed between the three traits, confirming the trade-off in carbon partitioning during seed development. The GWAS identified nine significant loci distributed across chromosomes 4, 6, 8, 9, 10, 16, and 18 as stable quantitative trait loci (QTLs) regulating the three traits. Of these, three pleiotropic loci (qProOil.4, qProOil.16, and qOilCarb.6) were found to be associated with multiple seed compositional traits. Haplotype analysis revealed seven haplotype blocks with distinct phenotypic variation, indicating that they have the potential for use as trait-specific markers in marker-assisted selection. Functional annotation of the stable QTL regions identified 22 putative genes, among which five candidate genes, Glyma.06g201700, Glyma.08g281900, Glyma.09g164900, Glyma.13g155600, and Glyma.16g209800 were likely to be involved in carbon allocation, protein biosynthesis, lipid metabolism, and carbohydrate modification pathways based on their relative expression levels. Overall, this study enhances the understanding of the genetic and molecular mechanism controlling the composition of soybean seed and identifies promising genomic targets for precision breeding programs aimed at improving nutritional quality.

## Full-text entities

- **Chemicals:** Carbohydrate (MESH:D002241), Oil (MESH:D009821), lipid (MESH:D008055), carbon (MESH:D002244)
- **Species:** Glycine max (soybean, species) [taxon 3847]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13030290/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13030290/full.md

## References

61 references — full list in the complete paper: https://tomesphere.com/paper/PMC13030290/full.md

---
Source: https://tomesphere.com/paper/PMC13030290