NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time-Series   Pretraining

Chenguo Lin; Xumeng Wen; Wei Cao; Congrui Huang; Jiang Bian; Stephen; Lin; Zhirong Wu

arXiv:2310.07402·cs.LG·July 11, 2024

NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time-Series Pretraining

Chenguo Lin, Xumeng Wen, Wei Cao, Congrui Huang, Jiang Bian, Stephen, Lin, Zhirong Wu

PDF

Open Access 1 Repo

TL;DR

NuTime introduces a numerically multi-scaled embedding approach for large-scale time-series pretraining, enabling effective semantic representation learning on datasets with millions of sequences.

Contribution

It presents a novel embedding module tailored for numerical properties of time-series data, scaling pretraining to large datasets and improving transfer performance across tasks.

Findings

01

Achieves state-of-the-art results on multiple benchmarks.

02

Significantly outperforms previous pretraining methods.

03

Effective on both univariate and multivariate tasks.

Abstract

Recent research on time-series self-supervised models shows great promise in learning semantic representations. However, it has been limited to small-scale datasets, e.g., thousands of temporal sequences. In this work, we make key technical contributions that are tailored to the numerical properties of time-series data and allow the model to scale to large datasets, e.g., millions of temporal sequences. We adopt the Transformer architecture by first partitioning the input into non-overlapping windows. Each window is then characterized by its normalized shape and two scalar values denoting the mean and standard deviation within each window. To embed scalar values that may possess arbitrary numerical amplitudes in a high-dimensional space, we propose a numerically multi-scaled embedding module enumerating all possible numerical scales for the scalars. The model undergoes pretraining with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chenguolin/nutime
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Machine Learning in Healthcare · Gaussian Processes and Bayesian Inference

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Softmax · Byte Pair Encoding · Label Smoothing · Adam · Absolute Position Encodings · Residual Connection