On the Role of Time Series Clustering in Traffic Matrix Prediction
Martha Cash, Charlotte Fowler, Alexander M. Wyglinski

TL;DR
This paper investigates how clustering traffic flows based on different representations can improve traffic matrix prediction accuracy and efficiency.
Contribution
It introduces a clustering-based framework for traffic prediction, analyzing the impact of various representations and cluster counts on performance.
Findings
Clustering improves prediction over global models.
Most gains occur at moderate cluster numbers.
Different representations yield similar RMSE despite different partitions.
Abstract
This paper analyzes the role of time-series clustering in traffic matrix (TM) prediction. Traffic flows within a TM often exhibit heterogeneous behavior, which can reduce the effectiveness of global forecasting models that predict all flows jointly. To address this, we propose a clustering-based prediction framework that groups flows into smaller subsets and trains separate predictors for each group. Four traffic-flow representations for clustering are explored, namely, histogram, autocorrelation function (ACF), power spectral density (PSD), and na\"ive partitioning, and how the representation choice and the number of clusters affect prediction performance. Experiments using the publicly available Abilene and G\'EANT datasets show that clustering consistently improves over global forecasting baselines, while remaining substantially less costly than local prediction. The results further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
