TIFO: Time-Invariant Frequency Operator for Stationarity-Aware Representation Learning in Time Series
Xihao Piao, Zheng Chen, Lingwei Zhu, Yushun Dong, Yasuko Matsubara, Yasushi Sakurai

TL;DR
TIFO introduces a frequency domain approach that learns stationary weights across datasets to improve nonstationary time series forecasting, effectively reducing distribution shift and computational costs.
Contribution
The paper proposes TIFO, a novel frequency operator that captures stationarity-aware features, enhancing forecasting accuracy and scalability in nonstationary time series.
Findings
Achieves 18 top-1 and 6 top-2 results out of 28 settings.
Yields 33.3% and 55.3% improvements in MSE on ETTm2.
Reduces computational costs by 60-70%.
Abstract
Nonstationary time series forecasting suffers from the distribution shift issue due to the different distributions that produce the training and test data. Existing methods attempt to alleviate the dependence by, e.g., removing low-order moments from each individual sample. These solutions fail to capture the underlying time-evolving structure across samples and do not model the complex time structure. In this paper, we aim to address the distribution shift in the frequency space by considering all possible time structures. To this end, we propose a Time-Invariant Frequency Operator (TIFO), which learns stationarity-aware weights over the frequency spectrum across the entire dataset. The weight representation highlights stationary frequency components while suppressing non-stationary ones, thereby mitigating the distribution shift issue in time series. To justify our method, we show…
Peer Reviews
Decision·Submitted to ICLR 2026
- This work focuses on the practical and important issue of *non-stationarity* in time-series forecasting, providing a frequency-domain perspective that complements conventional normalization approaches focused on low-order statistics.. - This work proposes a tightly integrated, plug-and-play pipeline: Stage-I computes frequency- and channel-wise stationarity $S(k,c)$; Stage-II maps SSS through two lightweight MLPs to produce real/imag frequency weights that reweight DFT coefficients before iD
(i) The conceptual novelty, while interesting, mainly lies in *reinterpreting normalization through frequency reweighting* rather than establishing a fundamentally new principle. The relation to prior spectral or stationarity-aware frameworks (e.g., FAN, FedFormer, FILM, FredFormer) is not deeply analyzed, leaving unclear how TIFO’s weighting differs from existing frequency-domain normalization or filtering approaches. (ii) The paper underexplores sensitivity of key design choices: the definiti
1. The idea of dataset-level spectral reweighting to favor stationary components is intuitive and can be attached to many backbones with low engineering overhead. 2. Reported improvements across many settings, plus substantial compute savings, suggest practical impact if substantiated with strong measurement methodology. 3. Framing distribution shift via frequency-domain structure is appealing. 4. The writing is generally clear.
1. Lines 188–189: It is unclear how the preceding analysis leads to the conclusion that stationary components should be enhanced while non-stationary components should be suppressed. The logical connection between the theoretical argument and this design choice should be elaborated. 2. Reducing the weights of non-stationary components may limit the model’s predictive upper bound, since important information carried by these components could be lost. Please discuss the potential trade-off betwee
(i) This paper introduces a dataset-level notion of frequency stability (mean/variance across samples) and uses it to reweight real/imag FFT components during training. (ii) The two-stage pipeline is easy to follow; the paper provides algorithm pseudocode and a clean description of how stability scores are computed and mapped to frequency weights. (iii) The operator is plug-and-play, requires minimal architectural change, and is computationally light, making it practical for adoption.
(i) The method assumes frequency bands that are stable on the training set remain so at test time. Abrupt shifts (new cycles, policy changes, outages) may violate this, and the paper offers no online update or detection mechanism. (ii) Reweighting real/imag parts independently ignores cross-channel phase structure and inter-series coherence; this could distort multivariate dynamics when phase relations carry signal. This paper do not (iii) Sensitivity to window length, FFT resolution, windowin
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Traffic Prediction and Management Techniques
