Non-Linear Determinants of Pedestrian Injury Severity: Evidence from Administrative Data in Great Britain
Yifei Tong

TL;DR
This study uses advanced machine learning techniques on British pedestrian collision data to identify key factors influencing injury severity, highlighting spatial and contextual variations to inform targeted safety interventions.
Contribution
It introduces a robust preprocessing pipeline and applies non-parametric ensemble models with interpretability tools to analyze complex injury severity determinants.
Findings
Vehicle count, speed limits, lighting, and road surface are key predictors.
Police attendance and junction features influence severity levels.
Rural areas show higher severity conditional on collision occurrence.
Abstract
This study investigates the non-linear determinants of pedestrian injury severity using administrative data from Great Britain's 2023 STATS19 dataset. To address inherent data-quality challenges, including missing information and substantial class imbalance, we employ a rigorous preprocessing pipeline utilizing mode imputation and Synthetic Minority Over-sampling (SMOTE). We utilize non-parametric ensemble methods (Random Forest and XGBoost) to capture complex interactions and heterogeneity often missed by linear models, while Shapley Additive Explanations are employed to ensure interpretability and isolate marginal feature effects. Our analysis reveals that vehicle count, speed limits, lighting, and road surface conditions are the primary predictors of severity, with police attendance and junction characteristics further distinguishing severe collisions. Spatially, while pedestrian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic and Road Safety · Automotive and Human Injury Biomechanics · Injury Epidemiology and Prevention
