Learning to Embed Time Series Patches Independently
Seunghan Lee, Taeyoung Park, Kibok Lee

TL;DR
This paper proposes a novel approach for time series representation learning that independently embeds patches using autoencoding and contrastive learning, outperforming existing Transformer-based methods in forecasting and classification tasks.
Contribution
It introduces a simple patch-wise embedding method with independent autoencoding and contrastive learning, challenging the dependency-based patch modeling in prior work.
Findings
Improves time series forecasting accuracy
Enhances classification performance
Reduces model complexity and training time
Abstract
Masked time series modeling has recently gained much attention as a self-supervised representation learning strategy for time series. Inspired by masked image modeling in computer vision, recent works first patchify and partially mask out time series, and then train Transformers to capture the dependencies between patches by predicting masked patches from unmasked patches. However, we argue that capturing such patch dependencies might not be an optimal strategy for time series representation learning; rather, learning to embed patches independently results in better time series representations. Specifically, we propose to use 1) the simple patch reconstruction task, which autoencode each patch without looking at other patches, and 2) the simple patch-wise MLP that embeds each patch independently. In addition, we introduce complementary contrastive learning to hierarchically capture…
Peer Reviews
Decision·ICLR 2024 poster
- Extensive experiments comparing to other methods and ablating different components. - Incremental but shown to improve approach with simple mechanisms. - Interesting analysis on distribution shift
- Novelty is weak, their contribution auto encoding is quite established before masked modelling literature and one of the early approaches to representation learning. It is more that exploring this within time-series data which seems to be there contribution. Also mixing of CL and masked modelling has been explored in other methods but not exactly similar to their approach. - Missing reference to previous method that Combine CL and MAE but in a different context and different method: Gong, Yu
1. The paper is generally well-written and easy to follow. The method seems straightforward to implement. 2. The experiments are thorough. The proposed method is evaluated on two tasks with a total of 12 datasets. Also, the transfer learning setting is explored. The key PI vs. PD task is analyzed through quantitative and qualitative experiments.
1. The training and inference efficiency analyses are brief or missing. It would be useful to see whether the patch-independent design can also bring benefits to inference time. The model requires contrastive learning and reconstruction loss, which might drastically increase training time compared to other supervised learning methods. Therefore it would be useful to see it compared to supervised learning methods as well.
1) Originality: The paper introduces a novel approach to self-supervised time series representation learning, with a focus on patch independence. This represents a unique perspective challenging the traditional methods of capturing patch dependencies. Additionally, it is the first model to integrate MTM and CL, showcasing the innovation of the implementation while improving model performance. 2) Quality: The paper is well-written and provides a comprehensive analysis of the proposed method. The
A substantive assessment of the weaknesses of the paper. Focus on constructive and actionable insights on how the work could improve towards its stated goals. Be specific, avoid generic remarks. For example, if you believe the contribution lacks novelty, provide references and an explanation as evidence: if you believe experiments are insufficient, explain why and exactly what is missing, etc. 1. The representation in Figure 1(a) should be clearer and correspond one-to-one with the description i
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Anomaly Detection Techniques and Applications
MethodsContrastive Learning
