Resilient Microservices: A Systematic Review of Recovery Patterns, Strategies, and Evaluation Frameworks
Muzeeb Mohammad

TL;DR
This systematic review consolidates research on microservice recovery strategies, introducing a taxonomy and evaluation framework to standardize resilience assessment and inform fault-tolerant system design.
Contribution
It provides a comprehensive taxonomy and evaluation checklist for microservice resilience, synthesizing fragmented research into a structured, evidence-based resource.
Findings
Identified nine key resilience themes in microservice recovery.
Developed a standardized Resilience Evaluation Score checklist.
Created a decision matrix for latency, consistency, and cost trade-offs.
Abstract
Microservice based systems underpin modern distributed computing environments but remain vulnerable to partial failures, cascading timeouts, and inconsistent recovery behavior. Although numerous resilience and recovery patterns have been proposed, existing surveys are largely descriptive and lack systematic evidence synthesis or quantitative rigor. This paper presents a PRISMA aligned systematic literature review of empirical studies on microservice recovery strategies published between 2014 and 2025 across IEEE Xplore, ACM Digital Library, and Scopus. From an initial corpus of 412 records, 26 high quality studies were selected using transparent inclusion, exclusion, and quality assessment criteria. The review identifies nine recurring resilience themes encompassing circuit breakers, retries with jitter and budgets, sagas with compensation, idempotency, bulkheads, adaptive backpressure,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Cloud Computing and Resource Management · Software-Defined Networks and 5G
