Confucius: Achieving Consistent Low Latency with Practical Queue Management for Real-Time Communications
Zili Meng, Nirav Atre, Mingwei Xu, Justine Sherry, Maria Apostolaki

TL;DR
Confucius is a practical queue management scheme that maintains low latency for real-time communications by aligning bandwidth adjustments with congestion control, without requiring end-host cooperation or manual tuning.
Contribution
It introduces a queue management method that reduces latency spikes and improves fairness without end-host collaboration or manual parameter tuning.
Findings
Reduces stall duration by over 50% compared to existing schemes.
Maintains low latency for real-time traffic despite competing Web flows.
Ensures fair performance for competing flows.
Abstract
Real-time communication applications require consistently low latency, which is often disrupted by latency spikes caused by competing flows, especially Web traffic. We identify the root cause of disruptions in such cases as the mismatch between the abrupt bandwidth allocation adjustment of queue scheduling and gradual congestion window adjustment of congestion control. For example, when a sudden burst of new Web flows arrives, queue schedulers abruptly shift bandwidth away from the existing real-time flow(s). The real-time flow will need several RTTs to converge to the new available bandwidth, during which severe stalls occur. In this paper, we present Confucius, a practical queue management scheme designed for offering real-time traffic with consistently low latency regardless of competing flows. Confucius slows down bandwidth adjustment to match the reaction of congestion control,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Timetabling Solutions · Advanced Queuing Theory Analysis
