# Predicting Multiple Traits of Rice and Cotton Across Varieties and Regions Using Multi-Source Data and a Meta-Hybrid Regression Ensemble

**Authors:** Yu Qin, Moughal Tauqir, Xiang Yu, Xin Zheng, Xin Jiang, Nuo Xu, Jiahua Zhang

PMC · DOI: 10.3390/s26020375 · Sensors (Basel, Switzerland) · 2026-01-06

## TL;DR

This paper introduces a new method to predict multiple crop traits for rice and cotton using combined data and machine learning, improving accuracy over existing models.

## Contribution

A novel Meta-Hybrid Regression Ensemble (MHRE) approach for multi-trait crop prediction across varieties and regions.

## Key findings

- MHRE outperformed individual models like XGBoost and RF for predicting rice and cotton traits.
- MHRE achieved high accuracy for rice yield (R2 = 0.78) and cotton bolls per plant (R2 = 0.93).
- The model was validated across five rice and three cotton varietal groups in six Chinese regions.

## Abstract

Timely and accurate prediction of crop traits is critical for precision breeding and regional agricultural production. Previous studies have primarily focused on single crop yield traits, neglecting other crop traits and variety-specific analyses. To address this issue, we employed a Meta-Hybrid Regression Ensemble (MHRE) approach by using multiple machine learning (ML) approaches as base learners, integrating regional multi-year, multi-variety crop field trials with satellite remote sensing indices, meteorological and phenological data to predict major crop traits. Results demonstrated MHRE’s optimal performance for rice and cotton, significantly outperforming individual models (RF, XGBoost, CatBoost, and LightGBM). Specifically, for rice crop, MHRE achieved highest accuracy for yield trait (R2 = 0.78, RMSE = 0.59 t ha−1) compared to the best individual model (XGBoost: R2 = 0.76, RMSE = 0.61 t ha−1); traits like effective spike also showed strong predictability (R2 = 0.64, RMSE = 27.81 10,000·spike ha−1). Similarly, for cotton, MHRE substantially improved yield trait prediction (R2 = 0.82, RMSE = 0.33 t ha−1) compared to the best individual model (RF: R2 = 0.77, RMSE = 0.36 t ha−1); bolls per plant accuracy was highest (R2 = 0.93, RMSE = 2.27 bolls plant−1). Moreover, rigorous validation confirmed that crop-specific MHRE models are robust across five rice and three cotton varietal groups and are applicable across six distinct regions in China. Furthermore, we applied the SHAP (SHapley Additive exPlanations) method to analyze the growth stages and key environmental factors affecting major traits. Our study illustrates a practical framework for regional-scale crop traits prediction by fusing multi-source data and ensemble machine learning, offering new insights for precision agriculture and crop management.

## Full-text entities

- **Species:** Oryza sativa (Asian cultivated rice, species) [taxon 4530]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12845638/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12845638/full.md

## References

83 references — full list in the complete paper: https://tomesphere.com/paper/PMC12845638/full.md

---
Source: https://tomesphere.com/paper/PMC12845638