Microreboot -- A Technique for Cheap Recovery
George Candea, Shinichi Kawamoto, Yuichi Fujiki, Greg Friedman,, Armando Fox

TL;DR
Microreboot is a fine-grain recovery technique that isolates faulty application components for rapid, cost-effective recovery, reducing downtime and enabling more resilient, high-availability Internet systems.
Contribution
This paper introduces microrebooting, a novel approach that separates process recovery from data recovery for faster, cheaper fault recovery in large-scale Internet applications.
Findings
Microreboots recover most failures faster than full reboots.
Microreboots reduce recovery time and work loss by an order of magnitude.
Microrebooting enables rejuvenation without system shutdown.
Abstract
A significant fraction of software failures in large-scale Internet systems are cured by rebooting, even when the exact failure causes are unknown. However, rebooting can be expensive, causing nontrivial service disruption or downtime even when clusters and failover are employed. In this work we separate process recovery from data recovery to enable microrebooting -- a fine-grain technique for surgically recovering faulty application components, without disturbing the rest of the application. We evaluate microrebooting in an Internet auction system running on an application server. Microreboots recover most of the same failures as full reboots, but do so an order of magnitude faster and result in an order of magnitude savings in lost work. This cheap form of recovery engenders a new approach to high availability: microreboots can be employed at the slightest hint of failure, prior to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOrthopedic Surgery and Rehabilitation · Reconstructive Surgery and Microvascular Techniques
