OSTSC: Over Sampling for Time Series Classification in R
Matthew Dixon, Diego Klabjan, Lan Wei

TL;DR
OSTSC is an R package that enhances time series classification by oversampling imbalanced data, significantly improving RNN classifier performance, especially in high-frequency trading datasets.
Contribution
This paper introduces OSTSC, a novel oversampling method for univariate, multinomial time series data in R, with demonstrated scalability and performance improvements.
Findings
OSTSC increases LSTM AUC from 0.543 to 0.784 on trading data.
The package improves classifier performance on imbalanced datasets.
OSTSC scales effectively to larger datasets.
Abstract
The OSTSC package is a powerful oversampling approach for classifying univariant, but multinomial time series data in R. This article provides a brief overview of the oversampling methodology implemented by the package. A tutorial of the OSTSC package is provided. We begin by providing three test cases for the user to quickly validate the functionality in the package. To demonstrate the performance impact of OSTSC, we then provide two medium size imbalanced time series datasets. Each example applies a TensorFlow implementation of a Long Short-Term Memory (LSTM) classifier - a type of a Recurrent Neural Network (RNN) classifier - to imbalanced time series. The classifier performance is compared with and without oversampling. Finally, larger versions of these two datasets are evaluated to demonstrate the scalability of the package. The examples demonstrate that the OSTSC package improves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Data Stream Mining Techniques
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
