From Canopy to Collision: A Hybrid Predictive Framework for Identifying Risk Factors in Tree-Involved Traffic Crashes
Abdul Azim, Ahmed Hossain, Soumyadip Maitra, Panick Kalambay

TL;DR
This study presents a hybrid analytical framework combining machine learning, SHAP explanations, and regression to identify key risk factors and interactions influencing injury severity in tree-involved traffic crashes.
Contribution
The paper introduces a comprehensive multi-step framework integrating ML, SHAP, and regression to analyze crash severity factors and their interactions.
Findings
Restraint non-use increases severe injury risk nearly threefold.
Vehicle age, speeding, and driver impairment significantly affect crash severity.
Interactions between lighting, vehicle age, speeding, and road surface amplify risk.
Abstract
Tree-involved crashes represent a critical subset of run-off-road (ROR) collisions, often resulting in fatal or severe injuries due to high-energy impacts. This study develops a comprehensive analytical framework to identify and quantify risk factors contributing to crash severity in tree-involved collisions using the Crash Report Sampling System (CRSS) database spanning 2020-2023. The modeling framework follows a multi-step process. First, a machine learning based classification model (CatBoost) identifies key factors associated with binary crash injury severity (KA: fatal or incapacitating injury versus BC: non-incapacitating or possible injury). Second, SHapley Additive exPlanations (SHAP) tool is used to quantify and visualize the marginal effects of top influential factors on crash severity. Third, a binary logistic regression model estimates factor effects and validates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
