Towards Fair and Efficient Learning-based Congestion Control
Xudong Liao, Han Tian, Chaoliang Zeng, Xinchen Wan, Kai Chen

TL;DR
Astraea is a novel learning-based congestion control framework using multi-agent deep reinforcement learning to achieve fair, stable, and fast convergence among competing flows, outperforming existing solutions.
Contribution
Introduces Astraea, a multi-agent deep reinforcement learning approach that explicitly optimizes fairness, convergence speed, and stability in congestion control.
Findings
Achieves near-optimal fairness in bandwidth sharing.
Converges up to 8.4 times faster than prior methods.
Exhibits 2.8 times smaller throughput deviation.
Abstract
Recent years have witnessed a plethora of learning-based solutions for congestion control (CC) that demonstrate better performance over traditional TCP schemes. However, they fail to provide consistently good convergence properties, including {\em fairness}, {\em fast convergence} and {\em stability}, due to the mismatch between their objective functions and these properties. Despite being intuitive, integrating these properties into existing learning-based CC is challenging, because: 1) their training environments are designed for the performance optimization of single flow but incapable of cooperative multi-flow optimization, and 2) there is no directly measurable metric to represent these properties into the training objective function. We present Astraea, a new learning-based congestion control that ensures fast convergence to fairness with stability. At the heart of Astraea is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Traffic and Congestion Control
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
