Parsimony or Capability? Decomposition Delivers Both in Long-term Time Series Forecasting
Jinliang Deng, Feiyang Ye, Du Yin, Xuan Song, Ivor W. Tsang, Hui Xiong

TL;DR
This paper shows that decomposition techniques enable long-term time series forecasting models to be both simpler and more effective, outperforming complex models with far fewer parameters by leveraging data's intrinsic dynamics.
Contribution
The study introduces a decomposition-based approach that reduces model complexity while maintaining or improving forecasting accuracy in long-term time series prediction.
Findings
Model outperforms benchmarks across datasets
Uses over 99% fewer parameters than competitors
Decomposition effectively controls model inflation
Abstract
Long-term time series forecasting (LTSF) represents a critical frontier in time series analysis, characterized by extensive input sequences, as opposed to the shorter spans typical of traditional approaches. While longer sequences inherently offer richer information for enhanced predictive precision, prevailing studies often respond by escalating model complexity. These intricate models can inflate into millions of parameters, resulting in prohibitive parameter scales. Our study demonstrates, through both analytical and empirical evidence, that decomposition is key to containing excessive model inflation while achieving uniformly superior and robust results across various datasets. Remarkably, by tailoring decomposition to the intrinsic dynamics of time series data, our proposed model outperforms existing benchmarks, using over 99 \% fewer parameters than the majority of competing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Forecasting Techniques and Applications
MethodsSparse Evolutionary Training · Multi-Head Attention · Attention Is All You Need · Absolute Position Encodings · Layer Normalization · Label Smoothing · Residual Connection · Dropout · Linear Layer · Byte Pair Encoding
