# MORPH: An Adaptive Framework for Efficient and Byzantine Fault-Tolerant   SDN Control Plane

**Authors:** Ermin Sakic, Nemanja {\DH}eri\'c, Wolfgang Kellerer

arXiv: 1902.02560 · 2019-02-08

## TL;DR

MORPH is an adaptive framework designed to enhance SDN control plane resilience by efficiently detecting and localizing Byzantine and availability failures, reconfiguring controllers dynamically to maintain system integrity and performance.

## Contribution

It introduces a novel fault-tolerant control plane framework that handles Byzantine and availability failures, with dynamic reconfiguration and practical correctness solutions for stateful SDN controllers.

## Key findings

- Successfully tolerates up to 10 controller failures in tests
- Reduces packet and CPU load after failure detection
- Improves response time through dynamic reconfiguration

## Abstract

Current approaches to tackle the single point of failure in SDN entail a distributed operation of SDN controller instances. Their state synchronization process is reliant on the assumption of a correct decision-making in the controllers. Successful introduction of SDN in the critical infrastructure networks also requires catering to the issue of unavailable, unreliable (e.g. buggy), and malicious controller failures. We propose MORPH, a framework tolerant to unavailability and Byzantine failures, which distinguishes and localizes faulty controller instances and appropriately reconfigures the control plane. Our controller-switch connection assignment leverages the awareness of the source of failure to optimize the number of active controllers and minimize the controller and switch reconfiguration delays. The proposed re-assignment executes dynamically after each successful failure identification. We require 2FM + FA +1 controllers to tolerate FM malicious and FA availability-induced failures. After a successful detection of FM malicious controllers, MORPH reconfigures the control plane to require a single controller message to forward the system state. Moreover, we outline and present a solution to the practical correctness issues related to the statefulness of the distributed SDN controller applications, previously ignored in the literature. We base our performance analysis on a resource-aware routing application, deployed in an emulated testbed comprising up to 16 controllers and up to 34 switches, so to tolerate up to 5 unique Byzantine and additional 5 availability-induced controller failures (a total of 10 unique controller failures). We quantify and highlight the dynamic decrease in the packet and CPU load and the response time after each successful failure detection.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.02560/full.md

## Figures

29 figures with captions in the complete paper: https://tomesphere.com/paper/1902.02560/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1902.02560/full.md

---
Source: https://tomesphere.com/paper/1902.02560