Towards Soft Circuit Breaking in Service Meshes via Application-agnostic Caching
Lars Larsson, William T\"arneberg, Cristian Klein, Maria Kihl, and, Erik Elmroth

TL;DR
This paper introduces a soft circuit breaker mechanism in service meshes that uses adaptive caching to improve resilience by returning stale data instead of errors, reducing network load and maintaining service availability.
Contribution
It proposes an application-agnostic caching-based soft circuit breaker for service meshes, enabling dynamic trade-offs between data freshness and network efficiency.
Findings
Caching reduces network load by up to 30%.
Approximately 80% of requests are served from cache.
Data staleness is maintained at about 3% or less.
Abstract
Service meshes factor out code dealing with inter-micro-service communication, such as circuit breaking. Circuit breaking actuation is currently limited to an "on/off" switch, i.e., a tripped circuit breaker will return an application-level error indicating service unavailability to the calling micro-service. This paper proposes a soft circuit breaker actuator, which returns cached data instead of an error. The overall resilience of a cloud application is improved if constituent micro-services return stale data, instead of no data at all. While caching is widely employed for serving web service traffic, its usage in inter-micro-service communication is lacking. Micro-services responses are highly dynamic, which requires carefully choosing adaptive time-to-life caching algorithms. We evaluate our approach through two experiments. First, we quantify the trade-off between traffic reduction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Cloud Computing and Resource Management · Distributed systems and fault tolerance
