Learning-Based vs Human-Derived Congestion Control: An In-Depth Experimental Study
Mihai Mazilu, Luca Giacomoni, George Parisis

TL;DR
This study systematically compares learning-based congestion control algorithms with traditional methods like TCP Cubic and BBR, revealing their strengths and limitations through large-scale experiments.
Contribution
It provides a reproducible methodology for evaluating learning-based CC, highlighting challenges, and offering insights into their performance and fairness properties.
Findings
Embedding fairness into reward functions improves fairness but does not generalize.
RL approaches can acquire all available bandwidth with low latency.
Existing learning-based CC under-performs with dynamic bandwidth and latency changes.
Abstract
Learning-based congestion control (CC), including Reinforcement-Learning, promises efficient CC in a fast-changing networking landscape, where evolving communication technologies, applications and traffic workloads pose severe challenges to human-derived, static CC algorithms. Learning-based CC is in its early days and substantial research is required to understand existing limitations, identify research challenges and, eventually, yield deployable solutions for real-world networks. In this paper, we extend our prior work and present a reproducible and systematic study of learning-based CC with the aim to highlight strengths and uncover fundamental limitations of the state-of-the-art. We directly contrast said approaches with widely deployed, human-derived CC algorithms, namely TCP Cubic and BBR (version 3). We identify challenges in evaluating learning-based CC, establish a methodology…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
