# Flood susceptibility assessment using three machine learning techniques and comparison of their performance

**Authors:** Tade Mule Asrade, Sintayehu Adefires Abebe, Kassahun Birhanu Tadesse, Mulu Sewinet Kerebih, Taye Minichil Meshesha

PMC · DOI: 10.1038/s41598-026-38391-0 · 2026-02-10

## TL;DR

This study compares three machine learning models for predicting flood susceptibility in the Choke Watershed, finding Gradient Boosting and Extreme Gradient Boosting to be the most accurate.

## Contribution

This is the first application of Gradient Boosting and Extreme Gradient Boosting models for flood susceptibility mapping in the Choke Watershed.

## Key findings

- Gradient Boosting and Extreme Gradient Boosting achieved the highest test accuracy of 0.97 for flood prediction.
- The study introduces a novel approach to flood susceptibility mapping using ensemble machine learning techniques.
- The results can be extended to other natural disaster risk assessments, such as earthquakes and landslides.

## Abstract

One of the most common natural disasters is flooding, which has the potential to seriously harm environments and infrastructure. Flood susceptibility mapping (FSM) is the main way to manage flood risk. It measures how likely a region is to flood in a quantitative way. The purpose of this study was to develop state-of-the-art ensemble machine learning (ML) models for flood prediction and to identify the most suitable approach for accurate flood susceptibility mapping. This study leverages diverse datasets, including elevation, slope, aspect, plan curvature, topographic wetness index, stream power index, distance from rivers, soil, rainfall, land use/land cover, and drainage density, which were used as conditioning factors to evaluate flood susceptibility in the Choke Watershed. Three machine learning (ML) algorithms were employed: Random Forest (RF), Gradient Boosting (GB), and Extreme Gradient Boosting (XGBoost). Model performance was assessed using confusion matrix metrics and the area under the receiver operating characteristic curve (AUROC). The Gradient Boosting (GB) and Extreme Gradient Boosting (XGBoost) models scored the highest in terms of test accuracy (0.97), followed by RF (0.96). This study is the first application of these models in the Choke Watershed for flood susceptibility mapping, with potential for broader applications to other natural disasters, including earthquakes and landslides. The results help strengthen global efforts aimed at mitigating natural disaster risks, particularly in Ethiopia, and advancing environmental sustainability.

## Full-text entities

- **Diseases:** Flood (MESH:C565009)

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12960810/full.md

---
Source: https://tomesphere.com/paper/PMC12960810