Identifying Operational Data-paths in Software Defined Networking Driven   Data-planes

Jos\'e Reyes; Jorge L\'opez; and Djamal Zeghlache

arXiv:1906.03144·cs.NI·June 10, 2019

Identifying Operational Data-paths in Software Defined Networking Driven Data-planes

Jos\'e Reyes, Jorge L\'opez, and Djamal Zeghlache

PDF

Open Access

TL;DR

This paper presents a method using distributed traffic generation and monitoring to identify and verify the actual operational data-paths in SDN data-planes, ensuring network reliability and security.

Contribution

It introduces a formal approach and toolkit for discovering operational data-paths in SDN, addressing inconsistencies and potential security issues.

Findings

01

Successfully identified operational data-paths in SDN environments

02

Revealed inconsistencies in existing SDN applications

03

Provided a toolkit for data-path verification

Abstract

In this paper, we propose an approach that relies on distributed traffic generation and monitoring to identify the operational data-paths in a given Software Defined Networking (SDN) driven data-plane. We show that under certain assumptions, there exist necessary and sufficient conditions for formally guaranteeing that all operational data-paths are discovered using our approach. In order to provide reliable communication within the SDN driven data-planes, assuring that the implemented data-paths are the requested (and expected) ones is necessary. This requires discovering the actual operational (running) data-paths in the data-plane. In SDN, different applications may configure different coexisting data-paths, the resulting data-paths a specific network flow traverses may not be the intended ones. Furthermore, the SDN components may be defected or compromised. We focus on discovering…

Equations7

I (e) =

I (e) =

H (p) = k x_{1, 1} x_{1, 2} \dots x_{1, k_{1}} k_{1} x_{2, 1} x_{2, 2} \dots x_{2, k_{2}} k_{2} \dots x_{n, 1} x_{n, 2} \dots x_{n, k_{n}} k_{n}

H (p) = k x_{1, 1} x_{1, 2} \dots x_{1, k_{1}} k_{1} x_{2, 1} x_{2, 2} \dots x_{2, k_{2}} k_{2} \dots x_{n, 1} x_{n, 2} \dots x_{n, k_{n}} k_{n}

\overline{x_{33}} \land \overline{x_{34}} \land \overline{x_{35}} \land \overline{x_{36}} \land \overline{x_{37}} \land \overline{x_{38}} \land \overline{x_{39}} \land \overline{x_{40}}

\overline{x_{33}} \land \overline{x_{34}} \land \overline{x_{35}} \land \overline{x_{36}} \land \overline{x_{37}} \land \overline{x_{38}} \land \overline{x_{39}} \land \overline{x_{40}}

\land \overline{x_{41}} \land x_{42} \land \overline{x_{43}} \land x_{44} \land \overline{x_{45}} \land \overline{x_{46}} \land \overline{x_{47}} \land \overline{x_{48}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware-Defined Networks and 5G · Software System Performance and Reliability · Internet Traffic Analysis and Secure E-voting

Full text

Identifying Operational Data-paths in Software Defined Networking Driven Data-planes

José Reyes, Jorge López, and Djamal Zeghlache

SAMOVAR, CNRS

Télécom SudParis / Université Paris-Saclay

9 Rue Charles Fourier, 91000 Évry, France

Email: {jose.reyes,jorge.lopez,djamal.zeghlache}@telecom-sudparis.eu

Abstract

In this paper, we propose an approach that relies on distributed traffic generation and monitoring to identify the operational data-paths in a given Software Defined Networking (SDN) driven data-plane. We show that under certain assumptions, there exist necessary and sufficient conditions for formally guaranteeing that all operational data-paths are discovered using our approach. In order to provide reliable communication within the SDN driven data-planes, assuring that the implemented data-paths are the requested (and expected) ones is necessary. This requires discovering the actual operational (running) data-paths in the data-plane. In SDN, different applications may configure different coexisting data-paths, the resulting data-paths a specific network flow traverses may not be the intended ones. Furthermore, the SDN components may be defected or compromised. We focus on discovering the operational data-paths on SDN driven data-planes. However, the proposed approach is applicable to any data-plane where the operational data-paths must be verified and / or certified. A data-path discovery toolkit has been implemented. We describe the corresponding set of tools, and showcase the obtained experimental results that reveal inconsistencies in well-known SDN applications.

Index Terms:

Software Defined Networking; Data-plane analysis; Distributed test case generation; Run-time monitoring;

I Introduction

Novel technologies allow flexible and fast network (re-)configuration of homogeneous network devices. Particularly, Software Defined Networking (SDN) [1] allows to centrally configure all data-plane (forwarding) devices; the data-plane devices (e.g., SDN-enabled switches) are configured by so-called SDN applications through the SDN controller. Some of the advantages of SDN are: (i) heterogeneous hardware can be managed with a single vendor-agnostic configuration interface; (ii) central configuration eases the management and reduces its execution time; and (iii) it avoids manual, error-prone configurations of the data-plane devices. For those reasons, SDN networks have evolved from small prototype networks to provider-scale network deployments [2], and their popularity constantly increases. Thus, guaranteeing the correct functional and non-functional behavior of such systems is crucial [3, 4].

Data-plane devices are configured with flow rules, that dictate the actions to perform once receiving the network packets. In fact, the highest priority rule that matches a given packet is used to determine the action to take (e.g., drop, forward, etc.). However, as SDN networks are highly dynamic (forwarding devices can be frequently re-configured, and furthermore, by different applications), verifying that the packets follow the correct (intended) data-paths is of special interest. Statically analyzing the rules installed in the data-plane is a common approach [5]. However, by employing this approach it may be impossible to retrieve the operational data-paths configured in a given data-plane. For instance, when all network packets are sent to the SDN controller to query the action to take (and subsequently an SDN application decides on the appropriate action to perform, see Section II-A for background concepts on SDN).

The data-paths installed in a data-plane must be correct with respect to a number of functional and non-functional properties. For instance, from the functional standpoint the installed data-paths should coincide with the requested ones [3]. From the non-functional standpoint (and particularly security), ensuring that there are no additional data-paths from the the intended ones is important to protect data secrecy; similarly, detecting fewer data-paths may be an indicator of a denial of service. In order to provide reliable communications within the SDN data-plane, it is necessary to verify that the data-paths configured in the data-plane are correct. However, to verify the data-paths configured at a given data-plane it is required to retrieve (discover) the data-paths that are actually implemented in the data-plane. An immediate question follows: how can the data-paths implemented in a given SDN data-plane be discovered? Further, can it be (formally) guaranteed? This paper is devoted to reply to the previously stated questions, particularly, we focus on providing a formal methodology for retrieving the actual data-paths configured in the data-plane. We assume that there are no restrictions on the access for any Point of Control (PC) nor for any Point of Observation (PO), i.e., we assume we can stimulate the SDN network at any data-generation point, and likewise, that we can observe the network’s reaction at any point.

In the existing literature, there are few works that address the stated problem. Moreover, to the best of our knowledge no formal methods guaranteeing the data-path discovery have been presented (for more information see Section V). For that reason, we propose a distributed traffic generation and monitoring approach; different network packets are generated at selected nodes (hosts) of the data-plane, and then by monitoring the interfaces of the data forwarding devices, the traffic graph (or data-path) is retrieved. To guarantee all the data-paths installed in the data-plane are identified, we prove the conditions when the execution of a test suite (a set of test cases, i.e., network packets to be generated at given PCs) is necessary and sufficient to observe all implemented data-paths (Section III). Using the proposed approach, a set of tools for data-path discovery has been implemented (Section IV). We showcase the experimental results obtained by employing the developed tool. Particularly, we show how a wide-spread SDN application forwards data in an inconsistent and ineffective manner. Further, the developed tools may be used in different application areas. For example, guaranteeing the installed data-paths conform to the requested ones; guaranteeing that the time to traverse a data-path is good (performance-wise); guaranteeing that there are no security faults in the data-plane; etc.

II Background

II-A Software Defined Networking

In traditional networks, the configuration, management, and data-forwarding interfaces are distributed / located at each of the data forwarding devices in the data-plane. The data-paths (the paths network packets follow in a data-plane) in the network are the result of the configuration on each of the forwarding devices; each of the devices has a local configuration and management interface. Thus, in order to re-configure the data-paths, several devices must be re-configured; as a consequence, while re-configuring each device the network may be in an inconsistent state, the process can be error-prone and slow. As an example, assume a data-plane in a traditional network as (only the data-plane) shown in Figure 1. Assume there is an issue with the link between the switches $s2$ and $s3$ in the data-plane. The data-path depicted in dashed arrows ( $h2\rightarrow s2\rightarrow s3\rightarrow h3$ ) becomes not operational. In order to re-configure this data-path, for example to $h2\rightarrow s2\rightarrow s4\rightarrow s3\rightarrow h3$ , the switches $s2,s3$ , and $s4$ must be re-configured, independently.

SDN overcomes these limitations by separating the control and the data-plane layers [1]. With a centralized SDN controller, SDN applications can automatically re-configure the SDN data-plane in a timely manner. Furthermore, the devices in the data-plane may have different protocols and interfaces (called southbound interface), while the controller has a single communication protocol (northbound interface) with the applications; thus, simplifying communication with heterogeneous and vendor-agnostic data-planes. Finally, SDN-enabled forwarding devices stir (route / forward) the incoming network packets based on so-called flow rules installed by the SDN applications (through the controller). A flow rule consists of three main (functional) parts: a packet matching part, an action part and a location / priority part. The matching part describes the values which a received network packet should have for a given rule to be applied. The action part states the required operations to perform to the matched network packets, while the location / priority part controls the hierarchy of the rules using tables and priorities. Finally, it is important to note that there exists a special output port for a flow rule, the controller port; when a packet is sent to the controller, the controller queries the SDN applications to decide the actions to perform to the packet; as a result, the controller may install new flow rules, drop or forward the packet to a specific port.

II-B System testing

Traditionally, system testing is conceived as a procedure to guarantee that a given System Under Test (SUT) is functionally and / or non-functionally correct. In this paper, we do not focus on guaranteeing correctness (we focus on providing means for guaranteeing such correctness), however, many of the concepts used in our approach are based on system testing and monitoring concepts.

Generally speaking, a given SUT $\mathcal{S}$ has an alphabet of input and output (observable reactions) symbols, $I$ and $O$ , respectively. When $\mathcal{S}$ is stimulated with a sequence of inputs $\alpha\in I^{*}$ , it produces a sequence of outputs $\beta\in O^{*}$ . To guarantee the correct behavior of $\mathcal{S}$ , formal / model-based testing for reactive systems is widely adopted. A test suite $TS$ is a set of input (test) sequences, sometimes a test suite can contain the expected output reactions. Additionally, a Point of Control (PC) is an interface where $\mathcal{S}$ can be stimulated (inputs can be generated for the system), and a Point of Observation (PO), is an interface where the output reactions of $\mathcal{S}$ can be observed. Notice that we consider the distributed nature by stimulating $S$ at several PCs, and furthermore collecting the information from different POs. Nonetheless, the information is collected and processed at a centralized analyzer [6].

III Analyzing SDN data-planes

III-A Basic concepts

This section presents the main contribution of our work. First, we introduce the necessary definitions and assumptions, and later we prove the necessary and sufficient conditions when all the data-paths in a data-plane are discovered. In this paper, we consider the data-plane is the SUT (where existing data-paths need to be retrieved for later verification), and accordingly we make the following assumptions regarding its structure and functional aspects.

The data-plane is given. We note that SDN controllers allow the automatic retrieval of the data-plane through the link layer discovery protocol [7] or others. However, we assume that the discovery process is more reliable if it does not depend on information provided by the SDN network, i.e., the SUT. One of the reasons is that the discovery procedure may be used to estimate the correctness of the SUT [3, 4] (see Section V for related work). 2. 2.

In the data-plane, there are data generation / reception devices, i.e., hosts, and data forwarding devices (e.g., switches); hosts do not forward traffic, and forwarding devices do not generate / receive data. Operating systems of networking devices may allow to violate this assumption, however, a networking device can be considered as two different ones in our model if this is the intended functional behavior. Additionally, the data-plane must have at least two hosts and one forwarding device; which makes sense from the functional point of view. 3. 3.

In the data-plane, devices are connected, however, no two hosts are connected to each other, and data forwarding devices have at least two connections. Furthermore, each pair of devices is connected though a single port, sharing a single link between them. These assumptions are reasonable for networking infrastructures, hosts are connected to a single access forwarding device, and a forwarding device with a single connection cannot forward data. Likewise, when two devices are connected with many links, the link is usually bonded and considered a single link with higher bandwidth. Finally, we assume that the communication between each pair of nodes is bidirectional. 4. 4.

In the data-plane, (network) packets generated by hosts are routed (stirred) by the forwarding devices. The forwarding devices can either drop a packet (not forward it) or it can be forwarded to a set of output ports (and correspondingly neighboring devices). Furthermore, we assume the decision is taken only depending on the input port and network packet headers [8]. Additionally, we assume the links are 100% reliable (we do not consider packet loss), and the packets are not altered by the forwarding devices (although our approach may work independently from this assumption). 5. 5.

In the data-plane, traffic can be generated at any of the hosts. Likewise, traffic can be observed at all the network interfaces in the data-plane. We assume there is no restriction of access. This is a reasonable assumption if the discovery procedure is part of a certification process. 6. 6.

Finally, we assume that the configuration in the data-plane is not modified while the discovery procedure is being executed. This may sound as a strong assumption given the dynamic nature of SDN networks. However, this scenario is feasible for certain cases, for example for a certification process. Moreover, the packets generated for testing purposes are distinguishable from data passing though the data-plane.

Given the previously stated assumptions, we consider the data-plane is given in our approach using the following definition.

Definition 1.

A data-plane $D$ is a weighted graph $(V,E,\mathcal{I})$ , where $V$ is the set of nodes, $E$ is the set of undirected edges (unordered pairs of nodes), and $\mathcal{I}$ is the interface function, that maps an edge to a pair of pairs of nodes and interfaces (the port numbers at each node), i.e., $\mathcal{I}:E\mapsto(V\times\mathds{N})^{2}$ (we consider the set of natural numbers denoted by $\mathds{N}$ as the set of non-negative integers). Additionally, $D$ has the following properties:

•

The set of vertices is the union of two subsets, the set of hosts $H\subset V$ and the set of forwarding devices $S\subset V$ , which are disjoint, and their cardinalities are restricted in the following manner: $2\leq|H|<|V|$ , $1\leq|S|<|V|$ , and $|H|+|S|=|V|$ .

•

The connectivity of the graph holds the following properties: (i) $\forall h\in H\;deg(h)=1$ ; (ii) $\forall s\in S\;deg(s)\geq 2$ , where $deg$ is the degree function, as usual; and (iii) $\forall e=\{v_{a},v_{b}\}\in E\;(v_{a}\in H\implies v_{b}\not\in H)\wedge(v_{b}\in H\implies v_{a}\not\in H)$ .

As an example, consider the data-plane shown in Figure 1, this data-plane is represented by the graph $D$ in Figure 2 with $H=\{h_{1},h_{2},h_{3},h_{4}\}$ , $S=\{s_{1},s_{2},s_{3},s_{4}\}$ , $E=\{\{h_{1},s_{1}\},\{h_{2},s_{2}\},\{h_{3},s_{3}\},\{h_{4},s_{4}\},\{s_{1},s_{2}\},\{s_{1},s_{3}\},\{s_{2},s_{3}\},\{s_{2},s_{4}\},\{s_{3},s_{4}\}\}$ , and $\mathcal{I}$ defined as follows:

[TABLE]

According to our assumptions, and particularly with the node type and traffic assumptions (Assumption 2 and 4, respectively), messages being sent from a host $h\in H$ are routed through different nodes of $S$ , and the decision where to route is deterministic, and is based on the values within the message (header) and the interface (or input port) where the message is received. For that reason, we define the concept of traffic type. First, we consider that a network packet $p$ is a binary string ( $p\in\{0,1\}^{*}$ ), and thus so its header. A packet header has a predefined set of parameters. Therefore, we consider a packet header as a Boolean vector, denoted $\mathcal{H}(p)$ . Consider the header with $n$ (relevant) parameters of total length $k=\sum_{i=1}^{n}k_{i}$ :

[TABLE]

In the OpenFlow (OF) protocol (a widespread and popular protocol for SDN-enabled switches) specification [8], there exists a minimum set of header parameters (and corresponding lengths) upon which the traffic can be matched (and later routed), for example, destination and source IP address, destination and source TCP port, etc. Therefore, forwarding devices route packets depending on the values of the Boolean vector $\mathcal{H}(p)$ . In general, the matching part of a forwarding rule has the form “ $parameter\;1=value\;1\wedge parameter\;2=value\;2\wedge\ldots\wedge parameter\;n=value\;n$ ”; as an example, “destination IP address = 10.0.0.1 $\wedge$ destination TCP port = 80”. Informally, we consider that the traffic type is a set of network packets whose header match particular values. Formally, we consider:

Definition 2.

A traffic type indicator $1_{\tau}$ is a Boolean function (characteristic function) for a given packet header $\mathcal{H}(p)\in\{0,1\}^{k}$ such that $1_{\tau}$ equals $1$ when $p$ belongs to the traffic type $\tau$ .

As an example, consider the traffic type $\tau=$ “packets with header matching destination TCP port = 80”, assume the only two relevant parameters are destination IP address of length 32, and destination TCP port of length 16 (the corresponding Boolean vector has length 48). Considering that the first 32 inputs correspond to the destination IP address, and the next 16 to the destination TCP port, $1_{\tau}$ can be expressed as:

[TABLE]

A packet $p$ is of type $\tau$ if $1_{\tau}(\mathcal{H}(p))=1$ ; for convenience, we denote it as $p_{\tau}$ . Further, the traffic of type $\tau$ is the set $\{p\in\{0,1\}^{*}|1_{\tau}(\mathcal{H}(p))=1\}$ . It is important to note that a packet of type $\tau$ may be routed differently from a packet of type $\tau^{\prime}$ , coming from the same predecessor (node). That implies that the paths taken by a given packet $p$ in a data-plane $D$ depend on its traffic type. Likewise, packets of the same packet type may be routed differently, when they arrive from different predecessors; this recursively applies, until the origin of the packet (a host in the data-plane). When data are sent (encapsulated in network packets) by hosts in the data-plane (see assumption 2), depending on the configuration of the forwarding devices, the data follow a specific data-path.

Definition 3.

A data-path for a packet $p$ with header $\mathcal{H}(p)$ in a data-plane $D=(H\cup S,E,\mathcal{I})$ starting at host $h\in H$ , denoted $\pi_{D}(h,\mathcal{H}(p))$ is a non-empty (potentially infinite) sequence of directed edges that represents a path that a network packet with a specific header follows in the data-plane, for a given host and traffic type. The $j$ -th edge in the path is denoted by $\pi_{D}^{j}(h,\mathcal{H}(p))$ ; we assume the first edge has index 1, and that the length of a data-path is the number of edges on it, denoted as $|\pi_{D}(h,\mathcal{H}(p))|$ . We denote the set all data-paths for a packet $p$ with header $\mathcal{H}(p)$ starting at host $h$ as $\Pi_{D}(h,\mathcal{H}(p))$ . Similarly, $\Pi_{D}(\mathcal{H}(p))$ the set of all data-paths for a packet $p$ with header $\mathcal{H}(p)$ ; the set of all data-paths of type $\tau$ in a data-plane is denoted as $\Pi_{D}(\tau)$ ; and the set of all data-paths in a data-plane is denoted as $\Pi_{D}$ . We note that $\pi_{D}(h,\mathcal{H}(p))$ holds the following properties:

•

$\forall j\in\{1,\ldots,|\pi_{D}(h,\mathcal{H}(p))|\}\;\pi_{D}^{j}(h,\mathcal{H}(p))=(v_{a},v_{b})\implies\{v_{a},v_{b}\}\in E$ , the directed edges are part of $D$ ;

•

$\forall j\in\{1,\ldots,|\pi_{D}(h,\mathcal{H}(p))|-1\}\;\pi_{D}^{j}(h,\mathcal{H}(p))=(v_{a},v_{b})\implies\pi_{D}^{j+1}(h,\mathcal{H}(p))=(v_{b},v_{c})$ , the sequence of edges connects vertices in $D$ ;

•

$\pi_{D}^{1}(h,\mathcal{H}(p))=(h,s_{h})$ , data-paths start at hosts.

As an example, in Figure 1 we present two different data-paths $(h_{1},s_{1})(s_{1},s_{2})(s_{2},h_{2})$ (depicted with solid arrows), and $(h_{2},s_{2})(s_{2},s_{3})(s_{3},h_{3})$ (depicted with dashed arrows). It is important to note that a packet $p$ generated at host $h$ can induce data across a set of paths, the reason is that switches can clone packets (forwarding traffic to a set of output ports, see assumption 4). However, a set of data-paths for a given packet has common prefixes, for example, the set of paths $\{(h_{1},s_{1})(s_{1},s_{2})(s_{1},s_{3})(s_{3},h_{3}),(h_{1},s_{1})(s_{1},s_{2})(s_{2},h_{2})\}$ , reflects the fact that at $s_{1}$ when receiving a packet with header $\mathcal{H}(p)$ from $h_{1}$ the switch outputs the packet to $s_{2}$ and $s_{3}$ simultaneously. In a data-path a repeated edge implies the existence of a loop, and evidently, the data-path is of infinite length.

III-B Identifying operational data-paths

In this section, we present a distributed test case generation / execution, and monitoring approach to discover the installed data-paths in a data-plane. We first give an overview of the proposed approach, and then proceed to the formal definitions and propositions that guarantee all (operational) data-paths are discovered using the presented approach.

In order to discover the implemented data-paths on a data-plane, we propose to stimulate the SUT (the data-plane) at different points with network packets, and to monitor at different network interfaces in the data-plane, in order to observe the existing data-paths. We propose to install points of control at all hosts’ interfaces, and points of observation at all forwarding devices’ interfaces. As an example, in Figure 1 we illustrate the PCs as (red) diamonds, and the POs as (green) circles. We consider a distributed test case a specific PC (host) and a packet header; executing a test case in a data-plane is to send a packet with the given headers from the given (host) PC; as hosts have a single interface (see assumption 3), there is a single interface where to send the network packet. After a test case is executed against an SUT, the packet is forwarded and while traversing a network interface, a monitor installed at the interface in question sends the observed packets to a centralized processing service and the traces are analyzed. We formally define:

Definition 4.

A distributed test case for a data-plane $D=(H\cup S,E,\mathcal{I})$ , denoted as $tc_{D}$ is a pair $(h,\mathcal{H}(p))$ , where $h\in H$ , and $\mathcal{H}(p)$ is a network packet header. Correspondingly, a test suite $TS_{D}$ is a set of test cases for a given data-plane. Executing a test case $tc_{D}=(h,\mathcal{H}(p))$ means to generate $p$ (with header $\mathcal{H}(p)$ ) at host $h$ .

We note that given the traffic assumption (assumption 4), we consider that the links are 100% reliable, and therefore, the test suites contain only distinct distributed test cases. Considering the reliability of a link in terms of probability is out of the scope of this paper and left for future work (see Section VI). As discussed before, after executing a distributed test case on a data-plane, the generated network packet is observed though the monitoring interfaces. Formally, we define:

Definition 5.

An observation $\omega_{tc}$ after executing a test case $tc_{D}$ in a data-plane $D=(H\cup S,E,\mathcal{I})$ is a triple $(s,i,t)$ , where $s\in S$ , $i$ is the interface at the node $s$ where the packet is observed, and $t$ is a time-stamp, when the observation occurred. Correspondingly, the set of all observations after executing a given test case $tc_{D}$ is denoted as $\Omega_{tc}$ .

As an example, consider the data-paths as depicted in Figure 1 (correspondingly, the data-plane as shown in Figure 2). Assume the path $(h_{1},s_{1})(s_{1},s_{2})(s_{2},h_{2})$ (depicted with solid arrows) is configured for packets with header matching destination TCP port = 80. Executing a distributed test case $tc=(h_{1},\mathcal{H}(p))$ (with header matching destination TCP port 80) produces the set of observations111For easiness and readability we use simple integers for the time-stamps. $\Omega_{tc}=\{(s_{1},1,1),(s_{1},2,2),(s_{2},2,3),(s_{2},1,4)\}$ . It is important to note that in this paper we assume all nodes in the data-plane are synchronized, and therefore, the time-stamps obtained at different nodes are comparable. Considering architectures with different clocks is out of the scope of the paper, and also left for future work. After understanding the relation between test cases and observations, an immediate question arises: can the set of observations be used to discover data-paths? For that reason, we present the following statement.

Proposition 1.

For any set of observations $\Omega_{tc}$ , after executing a test case $tc_{D}=(h,\mathcal{H}(p))$ in a data-plane $D=(H\cup S,E,\mathcal{I})$ , the set of data-paths $\Pi_{D}(h,\mathcal{H}(p))$ can be computed.

Proof.

By construction (in Algorithm 1), we show that the set of observations can be used to compute a set of data-paths. Further, we note that by definition (Definition 3) the set of data-paths that corresponds to the execution of $tc_{D}=(h,\mathcal{H}(p))$ is $\Pi_{D}(h,\mathcal{H}(p))$ . ∎

Note that Algorithm 1 creates a tree data structure (referred throughout the paper as a flow tree), in order to obtain the data-paths from the observations. Likewise, note that in the tree each of the nodes has a time where the message entered the node, the Time of Ingress (TI) and one Time of Egress (TE) per child, which denotes the time at which a message left the given node. Later, the set of paths is constructed using a simple depth-first search. To give a better intuition of how Algorithm 1 works, consider a test case $tc_{D}=(h,\mathcal{H}(p))$ , and a set of observations $\Omega_{tc}=\{(s_{1},1,1),(s_{1},2,2),(s_{1},3,3),(s_{2},2,4),(s_{3},2,5),(s_{2},1,6),(s_{3},1,7)\}$ ; some of the stages of the flow tree construction are depicted in Figure 3.

Algorithm 1 always terminates. There are three possibilities of how it terminates. First, if there is an observation that occurs at a given time, and in the flow tree there is no neighboring node with a smaller corresponding time, then Algorithm 1 does not attach it to the tree and returns an error; the reason is that a data-path must be connected (according to its definition), and in our assumptions all interfaces are monitored. Identifying a disconnected data-path may imply that a P.O. in a preceding forwarding device does not report the packet to the processing server. This may be an indication of an attack, however, determining such issues is out of the scope of this paper. The second case occurs when a given observation occurs at a node that has been previously traversed, and that eventually produces an infinite loop. Although a data-path can be infinite (and theoretically observations too), from the practical standpoint it serves no purpose to report such data-paths. Therefore, when a loop is detected Algorithm 1 terminates with a loop error. Finally, for any set of observations in which all its observations have a connecting neighbor with an appropriate chronological order, and in which no loops occur, Algorithm 1 appends a node in a tree data structure, at the corresponding level. Ultimately, the flow tree is transformed to a set of edge sequences. Such edge sequences have the following properties: i) they belong to $E$ as the algorithm queries the corresponding edges in $D$ before adding each node; ii) they are connected, as otherwise the algorithm returns an error; and, finally (iii) they start at host $h$ as the root of flow tree is set to it at Step 2. The previous properties held by the returned elements of Algorithm 1 guarantee that the returned sequences of edges are data-paths by definition. Thus, guaranteeing the correctness of the algorithm. Therefore, the following statement holds.

Proposition 2.

Algorithm 1 always terminates, and a set of data-paths $\Pi$ is given for a set of observations $\Omega$ that corresponds to a finite and connected network packet traversal at a data-plane $D$ , starting at host $h$ (in $D$ ).

Now we turn our attention to constructing the distributed test cases (and eventually the test suites) that guarantee the observation (discovery) of data-paths. Given the Assumption 4, Proposition 1 implies the following statement:

Corollary 1.

A set of data-paths $\Pi_{D}(h,\mathcal{H}(p))$ in a data-plane $D=(H\cup S,E,\mathcal{I})$ is observed iff the distributed test case $tc_{D}=(h,\mathcal{H}(p))$ is executed.

An immediate question follows: is there any distributed test suite that guarantees discovering all data-paths of a given type? Further, is there any distributed test suite that guarantees discovering all data-paths in a data-plane? To reply to these questions, let us first consider how to discover all data-paths for a given traffic type. According to our assumptions (particularly Assumption 2), only hosts generate traffic, and therefore data-paths can only start at hosts (see Definition 3). Thus, similarly to Corollary 1, provided that all data-paths of type $\tau$ are observed then a test suite $TS_{D}=\{(h,\mathcal{H}(p))|\mathcal{H}(p)\in\{0,1\}^{k}\wedge 1_{\tau}(\mathcal{H}(p))=1\wedge h\in H\}$ , containing a test case for each host in $D$ and for each packet header of type $\tau$ has been executed. Likewise, executing such a test suite guarantees observing all data-paths of type $\tau$ . Therefore, the following statement holds.

Proposition 3.

All data-paths in the data-plane $D=(H\cup S,E,\mathcal{I})$ (the set $\Pi_{D}$ ) are observed iff the distributed test suite $TS_{D}=\{(h,\mathcal{H}(p))|\mathcal{H}(p)\in\{0,1\}^{k}\wedge h\in H\}$ is executed.

As usual, it is interesting to estimate the length of the distributed test suites which guarantee the discovery of data-paths. We note that the distributed test suite that guarantees discovering all paths of a particular traffic header is a union of all data-paths with that header, and therefore its length is $|H|$ . The length of the test suite that guarantees discovering all data-paths of a given type of traffic $\tau$ highly depends on the form of the traffic type indicator functions. If restricting the traffic type indicator functions only to conjunctions in the form $parameter\;i=value\;i$ , the length of the test suite of interest is $2^{k-k^{\prime}}|H|$ , where $k^{\prime}$ is the length of the parameters involved in $1_{\tau}$ . Likewise, the distributed test suite that guarantees discovering all data-plane is $2^{k}|H|$ . This result sounds unpromising, considering that the length of the parameters involved (the relevant header parameters) are in the order of hundreds. Nonetheless, discovering the data-paths is often focused on interesting traffic, i.e., on particular headers or traffic types. In our implementation (see Section IV), we focus also on interesting traffic.

An interesting detail for implementations is the calculation of the maximal number of packets a monitoring system should collect after executing a test case. Perhaps the most interesting is the estimation of an upper bound for the cardinality of a set of data-paths. With this upper bound, the monitoring systems can decide when to stop the monitoring process, after the collection of a number of packets there is a guarantee there exists a loop in the SDN configuration (and therefore no need to continue monitoring).

Proposition 4.

The number of finite data-paths $\Pi_{D}(h,\mathcal{H}(p))$ in a data-plane $D=(V,E,\mathcal{I})$ does not exceed $(|V|-1)^{|V|(|V|-1)}$ , and the length of each path does not exceed $|V|(|V|-1)$ .

Proof.

A data-path without loops has no repeated edges, otherwise, the path is infinitely repeated. Thus, there exists at most $|V|(|V|-1)$ distinct edges, and therefore, a longest branch of a path has at most $|V|(|V|-1)+1$ nodes. Similarly, a node has at most $|V|-1$ edges. Therefore, if cloning at each node, there are $(|V|-1)^{l}$ nodes, where $l$ is the current length of the data-path (starting at [math]). Thus, the number of data-paths is at most $(|V|-1)^{|V|(|V|-1)}$ , and their maximal length is $|V|(|V|-1)$ . ∎

We note that studying the reachability of these upper bounds is an interesting question by itself. However, studying tighter bounds is left for future work.

IV Experimental results

In this section, we discuss the developed set of tools implementing the proposed methods, the experimental setup and the obtained experimental results.

Tools description

As previously discussed, our approach is based on distributed test case generation and monitoring. Five different tools (or modules) were developed, those are: (i) the extraction tool - a network packet sniffer which is installed at all network interfaces of the data-plane forwarding devices (POs), that forwards the traffic to the analyzer tool; (ii) the packet generation tool - a raw packet generator which is installed at all hosts in the data-plane, that generates packets with specific headers (in the widespread pcap-filter syntax [9]) and a Unique Identifier (UID) as the payload of the packet (to distinguish the test packets); (iii) the analyzer tool - a tool installed at the processing server that receives the packets, filters the packets having the UID and computes the flow trees from the set of observations; (iv) the orchestration tool - receives the traffic of interest to generate, requests the packet generation tool to send a network packet with the requested headers using a specific UID; and finally (v) the User Interface (UI) tool - a web interface for discovering the data-paths that receives the interesting traffic to discover from the user and reports the resulting data-paths (in the form of a flow tree). In Figure 4, we show how the tools interact between each other. The process starts with a user input, asking the UI to generate a packet with a specific header. Then, the UI forwards the request to the orchestrator and based on that header the orchestrator sends the data-plane model to the analyzer, the packets to filter to the extractions agents, and the packets to generate to the generation agents. The generation tool generates the appropriate packet and those packets are sent to the forwarding devices, and eventually the extraction tool captures, and forwards them to the analyzer. The analyzer computes the corresponding paths, and sends the information back directly to the UI, which displays the flow tree.

The tools have been developed in different programming languages, including Golang, Python and Javascript. For further details of the implementations, please visit the official repository222https://github.com/letitbeat/data-path-discovery.. Likewise, the interested reader can see our short data-path discovery demo333https://vimeo.com/307046352..

Experimental setup

In order to perform our experiments, different data-planes have been simulated using the well-known Mininet simulator, and more precisely, Containernet [10] has been used, in order to use docker containers with pre-installed tools. Additionally, the ONOS controller was used for all the experiments. In order to validate our toolset, experiments with installed flow rules were performed. Different data-paths have been configured for different topologies; the flow rules (or intents) were added through the ONOS REST API, the ONOS command line interface or its web (graphical) interface. Our tools are capable of successfully discovering the data-paths of interest (as configured) 100% of the time. Further, the observed data-paths coincide with the requested ones (in exception of data-paths containing known bugs for the architecture [11]). An example can be seen in our data-path discovery video demonstration.

Experiments with the ONOS Reactive Forwarding Application

Perhaps the most valuable contribution of our approach is to discover what real applications implement as SDN data-paths. To that end, a well-known SDN application has been installed, namely the ONOS Reactive Forwarding (ORF) application; Reactive forwarding “refers to the mechanism used to install forwarding entries into the network switches - those entries are installed on-demand after a sender starts transmitting packets.” [12]. The application has been installed in our experimental setup, the hosts in the data-plane successfully communicate between them. However, after the discovery process has been executed an unexpected result has been encountered. Indeed, the application does not compute a data-path to forward the network traffic; the application floods the network, i.e., forwarding devices send a received packet to all its neighbors in the data-plane, in exception of the sending device. As an example, consider the data-plane shown in Figure 2, a message (with header matching destination TCP port = 22) from $h_{1}$ to $h_{2}$ follows the set of data-paths shown in (as a flow tree) Figure LABEL:fig:reactive_observed while the expected is to have a single data-path, a possibility is shown in Figure LABEL:fig:reactive_expected. The previous behavior may be a functional error in the application or a miss-configuration, and in this paper, we do not focus on the cause, however, we highlight the value of the proposed approach in discovering the operational data-paths. The operational data-paths as configured by the ORF application may violate data secrecy, overload the network with unnecessary network packets, and may not conform to the reactive forward specification; both functional and non-functional errors may be detected using our proposed tools and methods.

Performance Evaluation

Although the performance of proposed approach could be evaluated theoretically, performing experiments on our experimental setup can reveal real processing times. In general, after executing a distributed test case, the time required to discover the data-paths is greatly influenced by: (i) the collection of observations and, (ii) the algorithm’s (Algorithm 1) processing time. In theory, the required time to collect the observations should be a linear function that depends of the length of a largest data-path. This under the assumption that an observation arrives to the centralized monitor as the network packet traverses the data-plane. Further, the delay between the forwarding devices has the largest influence on processing time. Similarly, the time the algorithm takes to process the set of observations should be a polynomial function that depends on the number of edges in a data-path and the number of data-paths itself (see Proposition 4).

In order to estimate the execution time of both the data collection and the algorithm processing different topologies have been configured and likewise, different data-paths with different lengths have been configured (up to big data-paths of length $75$ ). As can be seen in Figure 6, the time required to discover data-paths is mostly influenced by the data-collection (and the packet traversal itself in the data-plane). The previous results show that our approach may be applicable, even for identifying a somewhat large number of operational data-paths. Note that the the processing time of the algorithm is negligible in comparison with the data collection time penalty, and therefore, its increase is barely noticeable in the reported plot.

V Related work

The approaches presented in [3, 4] aim at performing model-based testing of the whole SDN framework using active testing techniques. After executing the test cases, the authors propose an approach to conclude about the implemented data-paths that inspired this work; their approach is based on specific traffic generation. However, the authors consider only pairwise ICMP echo request/reply traffic generation. Thus, it only discovers the pairwise reachability between hosts, and no other type of data-paths.

In [13], Handigol et al. introduce a network debugger tool to help identifying the root cause of network errors and particularly SDN related by showing the sequence of events which lead to aforementioned errors. We believe such tools are extremely important, nonetheless, their tool cannot be directly applied to our problem statement. Likewise, other methods focused on detecting inconsistencies are the ones presented in [14, 15, 16, 17].

SDN Traceroute [18] is a tool to trace the path a given packet follows; their approach relies on the prior identification of forwarding devices, then to install a set of rules at each of these devices, in order to forward to the controller the probes sent by the tool. The authors do not focus on providing formal guarantees for discovering all operational data-paths. Additionally, our approach does not install additional forwarding rules (or modifies the packet headers) in order to achieve the data-path discovery. Other approaches which have similar methods (to verify the forwarding rules) are presented in [19, 20].

NetSight [21] collects and stores information from network packets to build so-called “packet histories”; later, such information can be used to compute the packet’s “trace”. However, we note that if the test cases are not actively generated / executed, no guarantees can be provided, the conclusion about the data-paths depend on the observed traffic.

An interesting work used to verify certain properties over the data-plane configuration is presented in [5]. The presented tool collects data-plane information and network invariants, and converts them into Boolean expressions. Later the SAT (Boolean satisfiability) problem is used to check that the invariants hold over the data-plane configuration. Again, this approach is not focused on data-path discovery. Another similar work used to verify the data-plane against certain properties is presented in [22].

To the best of our knowledge, there are no works that focus on the discovery of data-paths in SDN data-planes which provide guarantees. Furthermore, most of the existing approaches focus on static analysis, which can be ineffective for certain configurations, e.g., forwarding rules to the controller.

VI Conclusion and future work

In this paper, an approach for identifying the operational data-paths in SDN data-planes has been presented; the approach is based on distributed test case generation and monitoring. We have shown the conditions when the execution of a test suite (a set of test cases, i.e., network packets to be generated at network hosts) are necessary and sufficient to observe all implemented data-paths. Furthermore, we have developed a set of tools that implement the proposed approach, and experimentally shown that it can be useful for discovering real data-paths installed by SDN applications, and how sometimes, undesired data-paths appear in the data-plane.

This work opens a number of directions that we plan to address in the future. For instance, studying the trade-off between the level of access and the guarantees given. Likewise, it is interesting to study the problem with different assumptions, e.g., regarding the reliability of packet transmission or the synchronized clocks within the forwarding devices. Additionally, it is important to further study the verification that can be performed after the data-paths are discovered; for example, reporting functional errors, repairing the forwarding rule configuration, etc.

Acknowledgment

The results obtained in this work were partially funded by the Celtic-Plus European project SENDATE, ID C2015/3-1. The authors would like to thank (in no particular order) Dr. Natalia Kushik from Télécom SudParis, as well as the ex-students Takeyuki Koyama and Yohann Lallier from the same institution, and to Prof. Nina Yevtushekno, Dr. Igor Burdonov, Dr. Alexandre Kossatchev from the Russian Academy of Sciences for fruitful discussions that led to the results obtained in this paper.

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Opennetworking, “Software-defined networking: The new norm for networks,” ONF White Paper , 2012. [Online]. Available: https://www.opennetworking.org
2[2] J. Wanderer, “Case study: The google sdn wan,” Computing. co. uk , vol. 11, 2013.
3[3] A. Berriri, J. López, N. Kushik, N. Yevtushenko, and D. Zeghlache, “Towards model based testing for software defined networks,” in Proceedings of the 13th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE, 2018. , 2018, pp. 440–446.
4[4] N. Yevtushenko, I. Burdonov, A. Kossachev, J. López, N. Kushik, and D. Zeghlache, “Test derivation for the software defined networking platforms: Novel fault models and test completeness,” in 2018 IEEE East-West Design Test Symposium (EWDTS) , Sep. 2018, pp. 1–6.
5[5] H. Mai, A. Khurshid, R. Agarwal, M. Caesar, P. B. Godfrey, and S. T. King, “Debugging the data plane with anteater,” in Proceedings of the ACM SIGCOMM 2011 Conference , ser. SIGCOMM ’11, New York, NY, USA, 2011, pp. 290–301.
6[6] J. López, S. Maag, and G. Morales, “Behavior evaluation for trust management based on formal distributed network monitoring,” World Wide Web , vol. 19, no. 1, pp. 21–39, 2016.
7[7] P. Congdon, “Link layer discovery protocol and mib,” V 1. 0 May , vol. 20, no. 2002, pp. 1–20, 2002.
8[8] O. Foundation, “Openflow switch specification v.1.5. 0,” 2015.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Identifying Operational Data-paths in Software Defined Networking Driven Data-planes

Abstract

Index Terms:

I Introduction

II Background

II-A Software Defined Networking

II-B System testing

III Analyzing SDN data-planes

III-A Basic concepts

Definition 1**.**

Definition 2**.**

Definition 3**.**

III-B Identifying operational data-paths

Definition 4**.**

Definition 5**.**

Proposition 1**.**

Proof.

Proposition 2**.**

Corollary 1**.**

Proposition 3**.**

Proposition 4**.**

Proof.

IV Experimental results

Tools description

Experimental setup

Experiments with the ONOS Reactive Forwarding Application

Performance Evaluation

V Related work

VI Conclusion and future work

Acknowledgment

Definition 1.

Definition 2.

Definition 3.

Definition 4.

Definition 5.

Proposition 1.

Proposition 2.

Corollary 1.

Proposition 3.

Proposition 4.