Predicting Terrorist Attacks in the United States using Localized News Data
Steven J. Krieg, Christian W. Smith, Rusha Chatterjee, Nitesh V., Chawla

TL;DR
This paper develops machine learning models, especially a Random Forest with a novel feature representation, to predict terrorist attacks in US states using localized news data, demonstrating promising results and highlighting the importance of location-specific modeling.
Contribution
Introduces a novel variable-length moving average feature representation and demonstrates its effectiveness in predicting terrorist attacks with localized models and machine learning techniques.
Findings
Random Forest outperforms deep models on noisy, imbalanced data
Localized models improve prediction accuracy
Model predictions are robust to attack timing and characteristics
Abstract
Terrorism is a major problem worldwide, causing thousands of fatalities and billions of dollars in damage every year. Toward the end of better understanding and mitigating these attacks, we present a set of machine learning models that learn from localized news data in order to predict whether a terrorist attack will occur on a given calendar date and in a given state. The best model--a Random Forest that learns from a novel variable-length moving average representation of the feature space--achieves area under the receiver operating characteristic scores on four of the five states that were impacted most by terrorism between 2015 and 2018. Our key findings include that modeling terrorism as a set of independent events, rather than as a continuous process, is a fruitful approach--especially when the events are sparse and dissimilar. Additionally, our results highlight the need…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
