Wavelet-Based Hybrid Machine Learning Model for Out-of-distribution Internet Traffic Prediction
Sajal Saha, Anwar Haque, and Greg Sidebottom

TL;DR
This paper presents a wavelet-based hybrid machine learning model that enhances internet traffic prediction, especially in out-of-distribution scenarios, by improving generalization over traditional models.
Contribution
It introduces a novel hybrid model combining wavelet decomposition with ensemble machine learning techniques to better handle distribution shifts in internet traffic data.
Findings
Hybrid model reduces performance gap in out-of-distribution tests.
Ensemble models achieve up to 96.4% accuracy on in-distribution data.
Wavelet decomposition improves out-of-distribution generalization.
Abstract
Efficient prediction of internet traffic is essential for ensuring proactive management of computer networks. Nowadays, machine learning approaches show promising performance in modeling real-world complex traffic. However, most existing works assumed that model training and evaluation data came from identical distribution. But in practice, there is a high probability that the model will deal with data from a slightly or entirely unknown distribution in the deployment phase. This paper investigated and evaluated machine learning performances using eXtreme Gradient Boosting, Light Gradient Boosting Machine, Stochastic Gradient Descent, Gradient Boosting Regressor, CatBoost Regressor, and their stacked ensemble model using data from both identical and out-of distribution. Also, we proposed a hybrid machine learning model integrating wavelet decomposition for improving out-of-distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Network Traffic and Congestion Control · Internet Traffic Analysis and Secure E-voting
