# Predicting dominant terrestrial biomes at a global scale using machine learning algorithms, climate variable indices, and extreme event indices

**Authors:** Hisashi Sato, Krishna Vadrevu, Chong Xu, Chong Xu, Chong Xu

PMC · DOI: 10.1371/journal.pone.0324107 · PLOS One · 2026-02-26

## TL;DR

This study uses machine learning to predict global terrestrial biomes, finding that extreme climate data slightly improves accuracy but reduces model robustness.

## Contribution

The study evaluates the impact of machine learning algorithms and extreme climate indices on global biome modeling accuracy and robustness.

## Key findings

- Random forest and CNN achieved the highest accuracy, with CNN being more robust to overfitting.
- Adding extreme climate indices increased accuracy by less than 2% but reduced model robustness.
- Summarizing climate data into indices slightly reduced model accuracy.

## Abstract

Understanding the global distribution of biomes is essential for biodiversity conservation, climate modeling, and land-use planning. Traditional approaches often summarize climate data into indices, and recent models sometimes include extreme events such as severe droughts or rare cold spells. This study evaluates how the choice of machine learning algorithm, climate data summarization, and extreme climate indices affect the accuracy and robustness of global biome modeling. Four algorithms were tested: random forest (RF), support vector machine (SVM), naive Bayes (NV), and LeNet convolutional neural network (CNN). RF and CNN achieved the highest accuracy, with CNN preferred due to RF’s stronger overfitting. Summarizing climate data into indices reduced accuracy by 1–2%, while adding extreme indices increased accuracy by <2% (except for NV, which performed poorly overall). However, extreme climate data caused large mismatches between observed and predicted climate values, reducing robustness as measured by prediction consistency. These results indicate that including extreme climate data in global biome prediction models offers limited accuracy gains but can significantly weaken robustness, so caution is advised.

## Full-text entities

- **Diseases:** Death (MESH:D003643), ArCS II (MESH:C537944)
- **Chemicals:** CO2 (MESH:D002245), CEI (-), water (MESH:D014867), carbon (MESH:D002244)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12944746/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944746/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/PMC12944746/full.md

---
Source: https://tomesphere.com/paper/PMC12944746