Yield forecasting with machine learning and small data: what gains for grains?
Michele Meroni, Fran\c{c}ois Waldner, Lorenzo Seguini, Herv\'e, Kerdiles, Felix Rembold

TL;DR
This study evaluates machine learning for crop yield forecasting using limited data in Algeria, showing that well-calibrated simple models can perform comparably to complex algorithms in small data scenarios.
Contribution
It demonstrates a robust pipeline for small data crop yield forecasting and highlights the importance of model calibration over complexity.
Findings
Machine learning models outperform benchmarks in yield prediction.
Proper model calibration is crucial for small data scenarios.
Simple benchmark models can be competitive with complex models.
Abstract
Forecasting crop yields is important for food security, in particular to predict where crop production is likely to drop. Climate records and remotely-sensed data have become instrumental sources of data for crop yield forecasting systems. Similarly, machine learning methods are increasingly used to process big Earth observation data. However, access to data necessary to train such algorithms is often limited in food-insecure countries. Here, we evaluate the performance of machine learning algorithms and small data to forecast yield on a monthly basis between the start and the end of the growing season. To do so, we developed a robust and automated machine-learning pipeline which selects the best features and model for prediction. Taking Algeria as case study, we predicted national yields for barley, soft wheat and durum wheat with an accuracy of 0.16-0.2 t/ha (13-14 % of mean yield)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
