Continual Traffic Forecasting via Mixture of Experts
Sanghyun Lee, Chanyoung Park

TL;DR
This paper introduces TFMoE, a mixture of experts model designed for continual traffic forecasting that effectively handles evolving sensor networks without catastrophic forgetting.
Contribution
The paper proposes a novel mixture of experts framework for traffic forecasting that segments traffic into groups, enabling continual learning in evolving networks.
Findings
Outperforms existing methods on PEMSD3-Stream dataset
Demonstrates robustness against catastrophic forgetting
Achieves efficient adaptation to new sensors
Abstract
The real-world traffic networks undergo expansion through the installation of new sensors, implying that the traffic patterns continually evolve over time. Incrementally training a model on the newly added sensors would make the model forget the past knowledge, i.e., catastrophic forgetting, while retraining the model on the entire network to capture these changes is highly inefficient. To address these challenges, we propose a novel Traffic Forecasting Mixture of Experts (TFMoE) for traffic forecasting under evolving networks. The main idea is to segment the traffic flow into multiple homogeneous groups, and assign an expert model responsible for a specific group. This allows each expert model to concentrate on learning and adapting to a specific set of patterns, while minimizing interference between the experts during training, thereby preventing the dilution or replacement of prior…
Peer Reviews
Decision·Submitted to ICLR 2024
1. The authors decompose the problem in structured way 2. The paper is well written and easy to follow 3. The problem which this paper handles is very interesting
1. Experiments are only done in one dataset. It is better to extend the scope of experiments to validate the method. 2. While the components are well combined, most of them are existing techniques. 3. Lack of analysis. Only the ablation of components were conducted. The paper would benefit from the additional analysis.
S1. The paper's significance is underscored by its goal to address a real-world problem. S2. The TFMoE model's novelty is encapsulated in its innovative usage of Mixture-of-Experts, along with three complementary mechanisms, effectively addressing the unique challenges in continual traffic forecasting. S3. The evaluation is comprehensive, with comparisons to baseline methods providing a compelling demonstration of the superior performance of the TFMoE model. S4. The clarity of the manuscrip
W1. Although the paper provides a comprehensive explanation of the methodology, further technical insights regarding the implementation and specific algorithms within the TFMoE method would be beneficial. W2. The paper falls short in providing a detailed analysis of the limitations of the proposed TFMoE, a factor which could be significant for future research and practical applications. W3. The computational complexity of the TFMoE algorithm, especially in the phases of continual training and
1. It proposes a novel approach to address the challenges of catastrophic forgetting and inefficiency in traffic forecasting under evolving networks. The Traffic Forecasting Mixture of Experts (TFMoE) method segments traffic flow into multiple homogeneous groups and assigns expert models to specific patterns, achieving superior performance and resilience in long-term streaming network datasets. 2. The paper provides extensive experimental results on a real-world long-term streaming network data
1. The Structure of the paper could be improved - Section 4 seems to be a bit too long, while part of the experiments, especially the settings and principles have to be left in the appendix. As I am not an expert in this area, section 4 is a little hard to follow, many complex modules are introduced in this section which makes it easy to lose. 2. As far as I know, clustering seems to be a common technique in the field of machine learning, while seldom reference is about the utilization of cluste
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Data Management and Algorithms · Advanced Clustering Algorithms Research
MethodsSparse Evolutionary Training
