Fault Injection Analytics: A Novel Approach to Discover Failure Modes in   Cloud-Computing Systems

Domenico Cotroneo; Luigi De Simone; Pietro Liguori; Roberto Natella

arXiv:2010.00331·cs.SE·October 2, 2020

Fault Injection Analytics: A Novel Approach to Discover Failure Modes in Cloud-Computing Systems

Domenico Cotroneo, Luigi De Simone, Pietro Liguori, Roberto Natella

PDF

TL;DR

This paper presents a new fault injection analytics approach using unsupervised machine learning to efficiently identify failure modes in cloud computing systems, demonstrated on OpenStack.

Contribution

It introduces a novel paradigm applying unsupervised learning to fault injection data for failure mode discovery in cloud systems.

Findings

01

Accurately identifies failure modes in cloud systems

02

Low computational cost for analysis

03

Effective on OpenStack platform

Abstract

Cloud computing systems fail in complex and unexpected ways due to unexpected combinations of events and interactions between hardware and software components. Fault injection is an effective means to bring out these failures in a controlled environment. However, fault injection experiments produce massive amounts of data, and manually analyzing these data is inefficient and error-prone, as the analyst can miss severe failure modes that are yet unknown. This paper introduces a new paradigm (fault injection analytics) that applies unsupervised machine learning on execution traces of the injected system, to ease the discovery and interpretation of failure modes. We evaluated the proposed approach in the context of fault injection experiments on the OpenStack cloud computing platform, where we show that the approach can accurately identify failure modes with a low computational cost.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.