Time Series Data Augmentation as an Imbalanced Learning Problem

Vitor Cerqueira; Nuno Moniz; Ricardo In\'acio; Carlos Soares

arXiv:2404.18537·cs.LG·April 30, 2024

Time Series Data Augmentation as an Imbalanced Learning Problem

Vitor Cerqueira, Nuno Moniz, Ricardo In\'acio, Carlos Soares

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel data augmentation method for univariate time series, framing the problem as an imbalanced learning task to generate synthetic samples and improve forecasting accuracy.

Contribution

It presents a new oversampling-based approach for generating synthetic time series data, addressing data scarcity and pattern capture issues in forecasting models.

Findings

01

Outperforms global and local models in accuracy

02

Effective across 7 diverse datasets with 5502 time series

03

Improves model performance by addressing data imbalance

Abstract

Recent state-of-the-art forecasting methods are trained on collections of time series. These methods, often referred to as global models, can capture common patterns in different time series to improve their generalization performance. However, they require large amounts of data that might not be readily available. Besides this, global models sometimes fail to capture relevant patterns unique to a particular time series. In these cases, data augmentation can be useful to increase the sample size of time series datasets. The main contribution of this work is a novel method for generating univariate time series synthetic samples. Our approach stems from the insight that the observations concerning a particular time series of interest represent only a small fraction of all observations. In this context, we frame the problem of training a forecasting model as an imbalanced learning task.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vcerqueira/tser
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques