# TimeMAE: Self-Supervised Representations of Time Series with Decoupled Masked Autoencoders

**Authors:** Mingyue Cheng, Xiaoyu Tao, Zhiding Liu, Qi Liu, Hao Zhang, Rujiao Zhang, Enhong Chen

arXiv: 2303.00320 · 2026-03-02

## TL;DR

TimeMAE introduces a novel self-supervised framework for time series that segments data into semantic units and employs decoupled autoencoding, leading to improved transferability and performance in label-scarce settings.

## Contribution

It proposes a new masked modeling approach for time series using semantic units and decoupled encoding, addressing semantic density and representation mismatch issues.

## Key findings

- Outperforms baselines on five datasets
- Effective in label-scarce scenarios
- Enhances transfer learning performance

## Abstract

Learning transferable representations from unlabeled time series is crucial for improving performance in data-scarce classification. Existing self-supervised methods often operate at the point level and rely on unidirectional encoding, leading to low semantic density and a mismatch between pre-training and downstream optimization. In this paper, we propose TimeMAE, a self-supervised framework that reformulates masked modeling for time series via semantic unit elevation and decoupled representation learning. Instead of modeling individual time steps, TimeMAE segments time series into non-overlapping sub-series to form semantically enriched units, enabling more informative masked reconstruction while reducing computational cost. To address the representation discrepancy introduced by masking, we design a decoupled masked autoencoder that separately encodes visible and masked regions, avoiding artificial masked tokens in the main encoder. To guide pre-training, we introduce two complementary objectives: masked codeword classification, which discretizes sub-series semantics via a learned tokenizer and masked representation regression, which aligns continuous representations through a momentum-updated target encoder. Extensive experiments on five datasets demonstrate that TimeMAE outperforms competitive baselines, particularly in label-scarce scenarios and transfer learning scenarios.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2303.00320/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/2303.00320/full.md

## References

65 references — full list in the complete paper: https://tomesphere.com/paper/2303.00320/full.md

---
Source: https://tomesphere.com/paper/2303.00320