Enhancing Transformer Efficiency for Multivariate Time Series Classification
Yuqing Wang, Yun Zhao, Linda Petzold

TL;DR
This paper introduces a module-wise pruning and Pareto analysis approach to improve the efficiency of Transformer models for multivariate time series classification, balancing accuracy with computational costs.
Contribution
It presents a novel methodology combining pruning and Pareto analysis to optimize Transformer efficiency specifically for large-scale multivariate time series classification.
Findings
Effective reduction in training time and memory usage.
Maintains high classification accuracy after pruning.
Demonstrated on benchmark datasets.
Abstract
Most current multivariate time series (MTS) classification algorithms focus on improving the predictive accuracy. However, for large-scale (either high-dimensional or long-sequential) time series (TS) datasets, there is an additional consideration: to design an efficient network architecture to reduce computational costs such as training time and memory footprint. In this work we propose a methodology based on module-wise pruning and Pareto analysis to investigate the relationship between model efficiency and accuracy, as well as its complexity. Comprehensive experiments on benchmark MTS datasets illustrate the effectiveness of our method.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Anomaly Detection Techniques and Applications · Advanced Chemical Sensor Technologies
MethodsPruning · Matching The Statements
