# A quantitative geospatial analysis of the risk that Boko Haram will target a school

**Authors:** Lirika Sola, Youdinghuan Chen, V. S. Subrahmanian, Steve Zimmerman, Jessica Leight, Jessica Leight

PMC · DOI: 10.1371/journal.pone.0320939 · 2025-06-17

## TL;DR

This paper uses data and machine learning to predict the risk of Boko Haram attacking schools in Nigeria based on security, activity, and socioeconomic factors.

## Contribution

The paper introduces a novel geospatial dataset and machine learning models to predict Boko Haram school attack risks.

## Key findings

- Security presence, Boko Haram activity, and socioeconomic factors are key predictors of school attacks.
- Machine learning models can accurately quantify the likelihood of a school being targeted.
- Decision trees reveal specific conditions that increase the risk of attacks.

## Abstract

We provide a novel quantitative geospatial analysis of school attacks perpetrated by Boko Haram in Nigeria. Such attacks are used by Boko Haram to kidnap boys (for potential use as child soldiers and suicide bombers) and girls (for potential use as domestic servants, as sex slaves, and suicide bombers). We first build a novel geospatially tagged data set spanning almost 15 years (July 2009 to April 2023) of data not only on Boko Haram attacks on schools (our dependent variable) but also a set of 15 independent variables (or features) about other attacks by Boko Haram, locations of security installations, as well as socioeconomic and geospatial characteristics of the regions around these schools. Second, we develop a univariate statistical analysis of this data, showing strong links between three broad factors affecting attacks on schools: Security presence in and around a school, the Boko Haram Activity in the area around a school, and the Socioeconomic characteristics of the region around a school. Third, we train several predictive machine learning models and assess their predictive efficacy. The results show that some of these models can accurately quantify the likelihood that a school will be at risk of a Boko Haram attack. In addition, they cast light on the features that are most important in making such predictions. We then analyze learned decision trees to identify some conditions on the independent variables that help predict Boko Haram attacks on school. Fourth, we use these decision trees to formulate multivariate hypotheses that we investigate further from a statistical perspective. We find that Security presence near schools, Activity of Boko Haram in regions, and the Socioeconomic factors characterizing the region a school is in are all significant predictors of attacks. We conclude with a policy recommendation.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

50 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12173403/full.md

---
Source: https://tomesphere.com/paper/PMC12173403