FaST: Efficient and Effective Long-Horizon Forecasting for Large-Scale Spatial-Temporal Graphs via Mixture-of-Experts
Yiji Zhao, Zihao Zhong, Ao Wang, Haomin Wen, Ming Jin, Yuxuan Liang, Huaiyu Wan, Hao Wu

TL;DR
FaST introduces a scalable, efficient framework for long-horizon spatial-temporal graph forecasting using heterogeneity-aware Mixture-of-Experts, enabling accurate predictions over large networks with reduced computational costs.
Contribution
The paper proposes a novel heterogeneity-aware Mixture-of-Experts framework with adaptive graph attention and parallel MoE modules for scalable long-horizon STG forecasting.
Findings
Outperforms state-of-the-art models in accuracy for long-horizon predictions.
Achieves significant reductions in computational time and memory usage.
Successfully predicts one-week-ahead traffic patterns on large-scale graphs.
Abstract
Spatial-Temporal Graph (STG) forecasting on large-scale networks has garnered significant attention. However, existing models predominantly focus on short-horizon predictions and suffer from notorious computational costs and memory consumption when scaling to long-horizon predictions and large graphs. Targeting the above challenges, we present FaST, an effective and efficient framework based on heterogeneity-aware Mixture-of-Experts (MoEs) for long-horizon and large-scale STG forecasting, which unlocks one-week-ahead (672 steps at a 15-minute granularity) prediction with thousands of nodes. FaST is underpinned by two key innovations. First, an adaptive graph agent attention mechanism is proposed to alleviate the computational burden inherent in conventional graph convolution and self-attention modules when applied to large-scale graphs. Second, we propose a new parallel MoE module that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Advanced Graph Neural Networks · Human Mobility and Location-Based Analysis
