Identification of Spikes in Time Series

Dana E. Goin; Jennifer Ahern

arXiv:1801.08061·stat.AP·January 25, 2018

Identification of Spikes in Time Series

Dana E. Goin, Jennifer Ahern

PDF

TL;DR

This study evaluates various spike detection methods in time series data through simulations and real-world violence rate analysis, finding Kalman filtering and smoothing to be the most effective.

Contribution

It systematically compares multiple spike detection techniques in a simulation setting and applies the best method to real-world data.

Findings

01

Kalman filtering and smoothing outperformed other methods in sensitivity and specificity.

02

The best method successfully identified spikes in violence rates across California cities.

03

Simulation results guide practical spike detection in social science time series.

Abstract

Identification of unexpectedly high values in a time series is useful for epidemiologists, economists, and other social scientists interested in the effect of an exposure spike on an outcome variable. However, the best method to identify spikes in time series is not known. This paper aims to fill this gap by testing the performance of several spike detection methods in a simulation setting. We created simulations parameterized by monthly violence rates in nine California cities that represented different series features, and randomly inserted spikes into the series. We then compared the ability to detect spikes of the following methods: ARIMA modeling, Kalman filtering and smoothing, wavelet modeling with soft thresholding, and an iterative outlier detection method. We varied the magnitude of spikes from 10-50% of the mean rate over the study period and varied the number of spikes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.