# Revisiting Pearl’s influenza studies by bootstrapping for forward variable selection with a null factor

**Authors:** Roselinde Kessels, Chris Gotwalt, Guido Erreygers

PMC · DOI: 10.1371/journal.pone.0318685 · PLOS One · 2025-02-25

## TL;DR

This paper revisits historical influenza studies using modern statistical methods to better understand factors influencing pandemic severity.

## Contribution

The paper demonstrates how modern variable selection techniques can enhance historical epidemiological analysis.

## Key findings

- Pre-pandemic death rates from heart disease and all causes predict pandemic severity.
- Bootstrap simulations with a null factor refine model selection in historical data analysis.
- Modern statistical methods confirm and add nuance to Pearl's original conclusions.

## Abstract

In 1919 and 1921 Raymond Pearl published four empirical studies on the Spanish Flu epidemic in which he explored the factors that might explain the explosiveness and destructiveness of the epidemic in America’s largest cities. Using partial correlation coefficients he tried to isolate the net effects of the possible explanatory factors, such as general demographic characteristics of the cities and death rates for various diseases, on the variables measuring the severity of the epidemic. Instead of Pearl’s correlation analysis, we apply a bootstrap simulation to forward variable selection with a null factor for generalized linear regression with AICc validation. The null factor or pseudo-variable is a random variable that is independent of the response. The number of times it is included in the model selection simulation provides an important metric for deciding which terms should remain in the model. Our results are largely consistent with Pearl’s conclusions in that the pre-pandemic death rates from organic heart disease and from all causes are most predictive of pandemic explosiveness or severity. However, our results also contain substantive nuances. Our paper contributes to the literature showing that state-of-the-art methodology for variable selection proves useful for historical epidemiology.

## Full-text entities

- **Diseases:** heart disease (MESH:D006331), death (MESH:D003643), Flu (MESH:D007251)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11856291/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11856291/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/PMC11856291/full.md

---
Source: https://tomesphere.com/paper/PMC11856291