# Genomes to fields 2024 maize genotype by environment prediction competition

**Authors:** Qiuyue Chen, Jacob D. Washburn, Dayane Cristina Lima, Maria Cinta Romay, Joseph L. Gage, James B. Holland, Alencar Xavier, Seth C. Murray, David Ertl, Marco Lopez-Cruz, Gustavo de los Campos, Fernando M. Aguate, Timothy M. Beissinger, Martin O. Bohn, Edward S. Buckler, Jode Edwards, Sherry A. Flint-Garcia, Michael A. Gore, Candice N. Hirsch, Shawn M. Kaeppler, Aida Z. Kebede, Joseph E. Knoll, John K. McKay, Richard Minyo, Osler A. Ortez, Jonathan W. Reneau, James C. Schnable, Rajandeep S. Sekhon, Maninder P. Singh, Erin E. Sparks, Addie M. Thompson, Mitchell R. Tuinstra, Jason Wallace, Wenwei Xu, Natalia de Leon

PMC · DOI: 10.1186/s13104-026-07629-5 · 2026-02-09

## TL;DR

This paper introduces a competition to predict maize grain yield using historical and environmental data from 2014 to 2024.

## Contribution

The novel contribution is the release of a curated dataset combining genotype, environment, and yield data for maize prediction modeling.

## Key findings

- The dataset includes phenotypic, genotypic, soil, weather, and environmental data from 2014 to 2023.
- The dataset was curated with quality control and consistent naming conventions for usability.
- The resource supports prediction modeling for maize genotype by environment interactions.

## Abstract

The genomes to fields (G2F) 2024 Maize Genotype by Environment (GxE) Prediction Competition challenged participants to develop and submit their best performing models to predict grain yield for the 2024 maize GxE project field trials, using G2F data collected from 2014 to 2023 and other publicly available data.

The G2F Maize GxE Project is a collaborative effort, with all generated data made publicly available. The resource presented here includes the training and test datasets used for the G2F 2024 Maize GxE Prediction Competition. Specifically, data collected from 2014 to 2023 served as the training set to predict grain yield in the 2024 test set. The dataset comprises phenotypic, genotypic, soil, weather, and environmental covariate data, along with metadata describing environments (year-location combinations). It has been curated and lightly filtered for quality control and to ensure consistent naming across years. Competitors also had access to readme files that describe the structure and content of the datasets.

## Linked entities

- **Species:** Zea mays (taxon 4577)

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

---
Source: https://tomesphere.com/paper/PMC12983533