Tetris: Efficient Intra-Datacenter Calls Packing for Large Conferencing Services
Rohan Gandhi, Ankur Mallick, Ken Sueda, Rui Liang

TL;DR
This paper introduces Tetris, a framework for optimizing call packing in datacenter servers to reduce CPU hotspots, improve performance, and lower costs in large conferencing services.
Contribution
Tetris is a novel multi-step framework that uses historical data and linear optimization to balance call loads across MPs, addressing variability and burstiness in call arrivals.
Findings
Reduces hot MP CPU utilization by at least 2.5X
Improves call packing efficiency in large datacenter traces
Decreases hosting costs and performance degradation risks
Abstract
Conference services like Zoom, Microsoft Teams, and Google Meet facilitate millions of daily calls, yet ensuring high performance at low costs remains a significant challenge. This paper revisits the problem of packing calls across Media Processor (MP) servers that host the calls within individual datacenters (DCs). We show that the algorithm used in Teams -- a large scale conferencing service as well as other state-of-art algorithms are prone to placing calls resulting in some of the MPs becoming hot (high CPU utilization) that leads to degraded performance and/or elevated hosting costs. The problem arises from disregarding the variability in CPU usage among calls, influenced by differences in participant numbers and media types (audio/video), compounded by bursty call arrivals. To tackle this, we propose Tetris, a multi-step framework which (a) optimizes initial call assignments by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Parallel Computing and Optimization Techniques · Software-Defined Networks and 5G
