TL;DR
This paper introduces SSDA, a dual-adaptation approach that bridges spectral and structural gaps in vision-based time series forecasting, significantly improving performance over existing large vision models.
Contribution
The paper proposes a novel dual-branch network with spectral and structural adaptation mechanisms to enhance vision models for time series forecasting.
Findings
SSDA outperforms baseline models on seven real-world benchmarks.
Spectral Magnitude Aligner improves natural image spectrum alignment.
Structural-Guided Low-Rank Adaptation enhances temporal encoding and attention.
Abstract
Large vision models (LVMs) have recently proven to be surprisingly effective time series forecasters, simply by rendering temporal data as images. This success, how ever, rests on a largely unexamined premise: the rendered time series images are sufficiently close to natural images for knowledge in pre-trained models to transfer effectively. We argue that two gaps still remain, i.e., spectral and structural gaps, fundamentally limiting the potential of LVMs for time series forecasting. Spectrally, we systematically reveal that rendered time series images exhibit a markedly shallower power spectrum than the natural images LVMs are pre-trained to recognize. Structurally, reshaping 1D temporal sequences into 2D grids fabricates spurious spatial adjacencies while severing genuine temporal continuities, misleading the spatial inductive biases of pre-trained LVMs. To bridge these gaps, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
