A windowed correlation based feature selection method to improve time series prediction of dengue fever cases
Tanvir Ferdousi, Lee W. Cohnstaedt, and Caterina M. Scoglio

TL;DR
This paper introduces a novel correlation-based feature selection framework that enhances dengue fever prediction models by incorporating spatially relevant data from nearby locations, significantly improving accuracy.
Contribution
The work presents a new windowing and correlation-based method for selecting spatial features, improving dengue prediction accuracy with limited data.
Findings
Up to 33.6% accuracy improvement using the proposed method.
Achieved mean absolute error as low as 0.128 on normalized data.
Windowing techniques perform comparably, with outbreak detection being more data-efficient.
Abstract
The performance of data-driven prediction models depends on the availability of data samples for model training. A model that learns about dengue fever incidence in a population uses historical data from that corresponding location. Poor performance in prediction can result in places with inadequate data. This work aims to enhance temporally limited dengue case data by methodological addition of epidemically relevant data from nearby locations as predictors (features). A novel framework is presented for windowing incidence data and computing time-shifted correlation-based metrics to quantify feature relevance. The framework ranks incidence data of adjacent locations around a target location by combining the correlation metric with two other metrics: spatial distance and local prevalence. Recurrent neural network-based prediction models achieve up to 33.6% accuracy improvement on average…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
