ShapeFormer: Shapelet Transformer for Multivariate Time Series Classification
Xuan-May Le, Ling Luo, Uwe Aickelin, Minh-Tuan Tran

TL;DR
ShapeFormer is a novel transformer-based model that captures both class-specific shapelet features and generic features for multivariate time series classification, improving accuracy especially on imbalanced or similar-pattern datasets.
Contribution
The paper introduces ShapeFormer, a dual-module transformer architecture that extracts class-specific shapelet features and generic features, enhancing multivariate time series classification performance.
Findings
Achieved highest accuracy on 30 UEA datasets.
Effectively captures class-specific and generic features.
Outperforms state-of-the-art methods.
Abstract
Multivariate time series classification (MTSC) has attracted significant research attention due to its diverse real-world applications. Recently, exploiting transformers for MTSC has achieved state-of-the-art performance. However, existing methods focus on generic features, providing a comprehensive understanding of data, but they ignore class-specific features crucial for learning the representative characteristics of each class. This leads to poor performance in the case of imbalanced datasets or datasets with similar overall patterns but differing in minor class-specific details. In this paper, we propose a novel Shapelet Transformer (ShapeFormer), which comprises class-specific and generic transformer modules to capture both of these features. In the class-specific module, we introduce the discovery method to extract the discriminative subsequences of each class (i.e. shapelets)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Label Smoothing · Adam · Absolute Position Encodings · Dropout
