Dissecting Service Mesh Overheads
Xiangfeng Zhu, Guozhen She, Bowen Xue, Yu Zhang, Yongsu, Zhang, Xuan Kelvin Zou, Xiongchun Duan, Peng He, Arvind, Krishnamurthy, Matthew Lentz, Danyang Zhuo, Ratul Mahajan

TL;DR
This paper introduces MeshInsight, a tool for systematically measuring service mesh overheads, revealing significant latency and resource impacts that depend on configuration and workload, and evaluating optimization effects.
Contribution
The paper presents MeshInsight, a novel decompositional approach and tool for quantifying service mesh overheads and analyzing the effects of configurations and optimizations.
Findings
Service meshes can cause up to 185% higher latency.
Resource consumption can increase by up to 92% CPU cores.
Overheads are highly dependent on configuration and workload.
Abstract
Service meshes play a central role in the modern application ecosystem by providing an easy and flexible way to connect different services that form a distributed application. However, because of the way they interpose on application traffic, they can substantially increase application latency and resource consumption. We develop a decompositional approach and a tool, called MeshInsight, to systematically characterize the overhead of service meshes and to help developers quantify overhead in deployment scenarios of interest. Using MeshInsight, we confirm that service meshes can have high overhead -- up to 185% higher latency and up to 92% more virtual CPU cores for our benchmark applications -- but the severity is intimately tied to how they are configured and the application workload. The primary contributors to overhead vary based on the configuration too. IPC (inter-process…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Software System Performance and Reliability · Software-Defined Networks and 5G
