TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation

Daoyu Wang; Mingyue Cheng; Zhiding Liu; Qi Liu

arXiv:2410.05711·cs.LG·June 12, 2025·2 cites

TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation

Daoyu Wang, Mingyue Cheng, Zhiding Liu, Qi Liu

PDF

Open Access 3 Repos 1 Video

TL;DR

TimeDART introduces a self-supervised framework combining a causal Transformer and denoising diffusion to effectively learn both global trends and local patterns in time series data, improving downstream forecasting and classification tasks.

Contribution

It unifies Transformer-based global modeling with diffusion-based local pattern capturing in a self-supervised pre-training framework for time series.

Findings

01

Outperforms previous methods on forecasting tasks

02

Achieves superior classification accuracy

03

Effectively captures both long-term and local patterns

Abstract

Self-supervised learning has garnered increasing attention in time series analysis for benefiting various downstream tasks and reducing reliance on labeled data. Despite its effectiveness, existing methods often struggle to comprehensively capture both long-term dynamic evolution and subtle local patterns in a unified manner. In this work, we propose \textbf{TimeDART}, a novel self-supervised time series pre-training framework that unifies two powerful generative paradigms to learn more transferable representations. Specifically, we first employ a causal Transformer encoder, accompanied by a patch-based embedding strategy, to model the evolving trends from left to right. Building on this global modeling, we further introduce a denoising diffusion process to capture fine-grained local patterns through forward diffusion and reverse denoising. Finally, we optimize the model in an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation· slideslive

Taxonomy

TopicsNeural Networks and Applications · Time Series Analysis and Forecasting

MethodsDense Connections · Adam · Linear Layer · Residual Connection · Position-Wise Feed-Forward Layer · Attention Is All You Need · Label Smoothing · Dropout · Byte Pair Encoding · Absolute Position Encodings