Operational resilience: concepts, design and analysis
Alexander A. Ganin, Emanuele Massaro, Alexander Gutfraind, Nicolas, Steen, Jeffrey M. Keisler, Alexander Kott, Rami Mangoubi, and Igor Linkov

TL;DR
This paper introduces quantitative measures for engineering resilience applicable across various domains, demonstrating their use on models and real systems to inform design trade-offs for improved robustness.
Contribution
It proposes a unified approach to measure and analyze resilience in complex systems, applicable to physical, information, and social infrastructures.
Findings
Resilience can be optimized by adjusting redundancy and recovery parameters.
The approach is validated on models and the Linux operating system.
Nonlinear relationships between parameters and resilience are identified.
Abstract
Building resilience into today's complex infrastructures is critical to the daily functioning of society and its ability to withstand and recover from natural disasters, epidemics, and cyber-threats. This study proposes quantitative measures that implement the definition of engineering resilience advanced by the National Academy of Sciences. The approach is applicable across physical, information, and social domains. It evaluates the critical functionality, defined as a performance function of time set by the stakeholders. Critical functionality is a source of valuable information, such as the integrated system resilience over a time interval, and its robustness. The paper demonstrates the formulation on two classes of models: 1) multi-level directed acyclic graphs, and 2) interdependent coupled networks. For both models synthetic case studies are used to explore trends. For the first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
