Federated Foundation Models on Heterogeneous Time Series
Shengchao Chen, Guodong Long, Jing Jiang, Chengqi Zhang

TL;DR
This paper introduces FFTS, a federated learning framework for training general-purpose time series foundation models across heterogeneous datasets, enhancing cross-domain generalization for tasks like forecasting, imputation, and anomaly detection.
Contribution
The paper proposes a novel federated learning approach with regularization to effectively handle heterogeneity in time series data, improving model generalization across domains.
Findings
Superior performance on benchmark datasets
Effective handling of domain heterogeneity
Enhanced cross-domain task accuracy
Abstract
Training a general-purpose time series foundation models with robust generalization capabilities across diverse applications from scratch is still an open challenge. Efforts are primarily focused on fusing cross-domain time series datasets to extract shared subsequences as tokens for training models on Transformer architecture. However, due to significant statistical heterogeneity across domains, this cross-domain fusing approach doesn't work effectively as the same as fusing texts and images. To tackle this challenge, this paper proposes a novel federated learning approach to address the heterogeneity in time series foundation models training, namely FFTS. Specifically, each data-holding organization is treated as an independent client in a collaborative learning framework with federated settings, and then many client-specific local models will be trained to preserve the unique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsAttention Is All You Need · Adam · Dropout · Position-Wise Feed-Forward Layer · Softmax · Dense Connections · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Label Smoothing
