# Transcripts and genomic intervals associated with variation in metabolite abundance in maize leaves under field conditions

**Authors:** Ramesh Kanna Mathivanan, Connor Pedersen, Jonathan Turkus, Nikee Shrestha, Waqar Ali, J. Vladimir Torres-Rodriguez, Ravi V. Mural, Toshihiro Obata, James C. Schnable

PMC · DOI: 10.1186/s12864-025-11580-3 · BMC Genomics · 2025-05-01

## TL;DR

This study explores how genetic and transcriptomic factors influence metabolite levels in maize under field conditions, identifying genomic regions and transcripts linked to metabolic variation.

## Contribution

The study is the first to use field-grown maize to link metabolite variation with genomic and transcriptomic data using GWAS, TWAS, and AI-based methods.

## Key findings

- Genome-wide association and random forest methods identified significant associations between genes and metabolite abundance.
- Transcriptome-wide association studies found only six significant transcript-metabolite associations.
- Genetic markers for metabolite variation sometimes overlapped with markers for non-metabolic traits.

## Abstract

Plants exhibit extensive environment-dependent intraspecific metabolic variation, which likely plays a role in determining variation in whole plant phenotypes. However, much of the work seeking to use natural variation to link genes and transcript’s impacts on plant metabolism has employed data from controlled environments. Here, we generated and analyzed data on the variation in the abundance of 26 metabolites across 660 maize inbred lines under field conditions. We employ these data and previously published transcript and whole plant phenotype data reported for the same field experiment to identify both genomic intervals (through genome-wide association studies (GWAS)) and transcripts (using both transcriptome-wide association studies (TWAS) and an explainable artificial intelligence (AI) approach based on random forest (RF)) associated with variation in metabolite abundance. Both genome-wide association and random forest-based methods identified substantial numbers of significant associations including genes with plausible links to the metabolites they are associated with. In contrast, the transcriptome-wide association identified only six significant associations. In three cases, genetic markers associated with metabolic variation in our study colocalized with markers linked to variation in non-metabolic traits scored in the same experiment. We speculate that the poor performance of transcriptome-wide association studies in identifying transcript-metabolite associations may reflect a high prevalence of non-linear interactions between transcripts and metabolites and/or a bias towards rare transcripts playing a large role in determining intraspecific metabolic variation.

The online version contains supplementary material available at 10.1186/s12864-025-11580-3.

## Linked entities

- **Species:** Zea mays (taxon 4577)

## Full-text entities

- **Diseases:** ID (MESH:C537985), AI (MESH:C538142)
- **Chemicals:** aspartic acid (MESH:D001224), chloroform (MESH:D002725), lignin (MESH:D008031), loganin (MESH:C059516), mucic acid (MESH:C000090), arginine (MESH:D001120), fructose (MESH:D005632), nitrogen (MESH:D009584), citric acid (MESH:D019343), N-acetylglutamate 5-phosphate (-), quinic acid (MESH:D011801), vitamin E (MESH:D014810), phosphoric acid (MESH:C030242), chlorophyll (MESH:D002734), Beta-alanine (MESH:D015091), malic acid (MESH:C030298), chlorogenic acid (MESH:D002726), glycerol 1-phosphate (MESH:C029620), raffinose (MESH:D011887), Glutamic acid (MESH:D018698), glyceric acid (MESH:C042971), water (MESH:D014867), argon (MESH:D001128), Threonine (MESH:D013912), sugars (MESH:D000073893), D-glucose (MESH:D005947), Shikimic acid (MESH:D012765), N-Methyl-N-(trimethylsilyl) trifluoroacetamide (MESH:C086665), tyrosine (MESH:D014443), galactonic acid (MESH:C012991), carboxylic acids (MESH:D002264), myo-inositol (MESH:D007294), methanol (MESH:D000432), glucose- 6-phosphate (MESH:D019298), sucrose (MESH:D013395), L-serine (MESH:D012694), pyridine (MESH:C023666), methoxyamine hydrochloride (MESH:C005214), ribitol (MESH:D012255), caffeic acid (MESH:C040048), -alanine (MESH:D000409), aldehydes (MESH:D000447)
- **Species:** Arabidopsis thaliana (mouse-ear cress, species) [taxon 3702], Solanum tuberosum (potatoes, species) [taxon 4113], Oryza sativa (Asian cultivated rice, species) [taxon 4530], Brassica napus (oilseed rape, species) [taxon 3708], Solanum lycopersicum (tomato, species) [taxon 4081]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12046723/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12046723/full.md

## References

6 references — full list in the complete paper: https://tomesphere.com/paper/PMC12046723/full.md

---
Source: https://tomesphere.com/paper/PMC12046723