A Sequential Composition Framework for Coordinating Multi-Robot   Behaviors

Pietro Pierpaoli; Anqi Li; Mohit Srinivasan; Xiaoyi Cai; Samuel; Coogan; and Magnus Egerstedt

arXiv:1907.07718·cs.RO·March 4, 2020

A Sequential Composition Framework for Coordinating Multi-Robot Behaviors

Pietro Pierpaoli, Anqi Li, Mohit Srinivasan, Xiaoyi Cai, Samuel, Coogan, and Magnus Egerstedt

PDF

TL;DR

This paper introduces a distributed framework using finite-time convergence control barrier functions to coordinate multi-robot behaviors, ensuring communication constraints are met during task sequences in complex environments.

Contribution

It develops a novel distributed control framework that enables multi-robot systems to sequentially compose behaviors while respecting local communication constraints.

Findings

01

Successfully coordinated 8 robots in urban exploration and rescue tasks.

02

Demonstrated finite-time convergence to desired configurations for behavior transitions.

03

Validated the framework's effectiveness in real-world multi-robot scenarios.

Abstract

A number of coordinated behaviors have been proposed for achieving specific tasks for multi-robot systems. However, since most applications require more than one such behavior, one needs to be able to compose together sequences of behaviors while respecting local information flow constraints. Specifically, when the inter-agent communication depends on inter-robot distances, these constraints translate into particular configurations that must be reached in finite time in order for the system to be able to transition between the behaviors. To this end, we develop a distributed framework based on finite-time convergence control barrier functions that enables a team of robots to adjust its configuration in order to meet the communication requirements for the different tasks. In order to demonstrate the significance of the proposed framework, we implemented a full-scale scenario where a team…

Equations97

C = {x \in D ∣ h (x) \geq 0} .

C = {x \in D ∣ h (x) \geq 0} .

\overset{α}{ˉ}_{ρ, γ} (h (x)) = γ \cdot sign (h (x)) \cdot ∣ h (x) ∣^{ρ},

\overset{α}{ˉ}_{ρ, γ} (h (x)) = γ \cdot sign (h (x)) \cdot ∣ h (x) ∣^{ρ},

\overset{x}{˙} = f (x) + g (x) u

\overset{x}{˙} = f (x) + g (x) u

\sup_{u\in U}\bigg{\{}L_{f}h(x)+L_{g}h(x)u+\bar{\alpha}_{\rho,\gamma}(h(x))\bigg{\}}\geq 0\quad\forall x\in\mathcal{D},

\sup_{u\in U}\bigg{\{}L_{f}h(x)+L_{g}h(x)u+\bar{\alpha}_{\rho,\gamma}(h(x))\bigg{\}}\geq 0\quad\forall x\in\mathcal{D},

K (x) = {u \in U ∣ L_{f} h (x) + L_{g} h (x) u + \overset{α}{ˉ}_{ρ, γ} (h (x)) \geq 0} .

K (x) = {u \in U ∣ L_{f} h (x) + L_{g} h (x) u + \overset{α}{ˉ}_{ρ, γ} (h (x)) \geq 0} .

U (x) \in K (x) \forall x \in D,

U (x) \in K (x) \forall x \in D,

T \leq \frac{1}{γ ( 1 - ρ )} ∣ h (x_{0}) ∣^{1 - ρ} .

T \leq \frac{1}{γ ( 1 - ρ )} ∣ h (x_{0}) ∣^{1 - ρ} .

E (t) = {(i, j) \in V \times V ∣ ∥ x_{i} (t) - x_{j} (t) ∥ \leq Δ} .

E (t) = {(i, j) \in V \times V ∣ ∥ x_{i} (t) - x_{j} (t) ∥ \leq Δ} .

\overset{x}{˙} = f (x) + g (x) u

\overset{x}{˙} = f (x) + g (x) u

π = {B_{1}, \dots, B_{M}} .

π = {B_{1}, \dots, B_{M}} .

B_{k} = {U_{k}, G_{k}},

B_{k} = {U_{k}, G_{k}},

G (t) \supseteq {G_{k} G_{k} \cup G_{k + 1} t \in [t_{k}^{⊢}, t_{k}^{⊣}] t \in (t_{k}^{⊣}, t_{k + 1}^{⊢}) \forall k = 1, \dots, M - 1.

G (t) \supseteq {G_{k} G_{k} \cup G_{k + 1} t \in [t_{k}^{⊢}, t_{k}^{⊣}] t \in (t_{k}^{⊣}, t_{k + 1}^{⊢}) \forall k = 1, \dots, M - 1.

η (σ) = {1 if G_{k} \subseteq G (t) 0 o.w.

η (σ) = {1 if G_{k} \subseteq G (t) 0 o.w.

h_{ij}^{c} (x) = Δ^{2} - ∥ x_{i} - x_{j} ∥^{2},

h_{ij}^{c} (x) = Δ^{2} - ∥ x_{i} - x_{j} ∥^{2},

C_{ij}^{c}

C_{ij}^{c}

C_{k}^{c}

K_{k}^{c} (x) = {u \in U ∣ \dot{h}_{ij}^{c} (x) + \overset{α}{ˉ}_{ρ, γ} (h_{ij}^{c} (x)) \geq 0, \forall (i, j) \in E_{k}}

K_{k}^{c} (x) = {u \in U ∣ \dot{h}_{ij}^{c} (x) + \overset{α}{ˉ}_{ρ, γ} (h_{ij}^{c} (x)) \geq 0, \forall (i, j) \in E_{k}}

T_{k} = (i, j) \in E_{k} ∣ h_{ij}^{c} (x_{0}) < 0 max {\frac{1}{γ ( 1 - ρ )} ∣ h_{ij}^{c} (x_{0}) ∣^{1 - ρ}} .

T_{k} = (i, j) \in E_{k} ∣ h_{ij}^{c} (x_{0}) < 0 max {\frac{1}{γ ( 1 - ρ )} ∣ h_{ij}^{c} (x_{0}) ∣^{1 - ρ}} .

T_{ij} \leq \frac{1}{γ ( 1 - ρ )} ∣ h_{ij}^{c} (x_{0}) ∣^{1 - ρ} .

T_{ij} \leq \frac{1}{γ ( 1 - ρ )} ∣ h_{ij}^{c} (x_{0}) ∣^{1 - ρ} .

T_{k} = (i, j) \in E_{k} ∣ h_{ij}^{c} (x_{0}) < 0 max T_{ij} .

T_{k} = (i, j) \in E_{k} ∣ h_{ij}^{c} (x_{0}) < 0 max T_{ij} .

u^{*} = u \in U arg min ∥ \overset{u}{^}_{k - 1} - u ∥^{2}

u^{*} = u \in U arg min ∥ \overset{u}{^}_{k - 1} - u ∥^{2}

L_{f} h_{ij}^{c} + L_{g} h_{ij}^{c} u + \overset{α}{ˉ}_{ρ, γ} (h_{ij}^{c}) \geq 0,

L_{f} h_{ij}^{c} + L_{g} h_{ij}^{c} u + \overset{α}{ˉ}_{ρ, γ} (h_{ij}^{c}) \geq 0,

L_{f} h_{ij}^{c} + L_{g} h_{ij}^{c} u + \overset{α}{ˉ}_{ρ, γ} (h_{ij}^{c}) \geq 0,

L_{f} h_{ij}^{c} + L_{g} h_{ij}^{c} u + \overset{α}{ˉ}_{ρ, γ} (h_{ij}^{c}) \geq 0,

C_{k}^{s} = {x \in D ∣ h_{ℓ}^{s} (x) \geq 0, \forall ℓ = 1, \dots, s_{k}} .

C_{k}^{s} = {x \in D ∣ h_{ℓ}^{s} (x) \geq 0, \forall ℓ = 1, \dots, s_{k}} .

K_{k}^{s} (x) = {u \in U ∣ \dot{h}_{ℓ}^{s} (x) + \overset{α}{ˉ}_{ρ, γ} (h_{ℓ}^{s} (x)) \geq 0, \forall ℓ = 1, \dots, s_{k}} .

K_{k}^{s} (x) = {u \in U ∣ \dot{h}_{ℓ}^{s} (x) + \overset{α}{ˉ}_{ρ, γ} (h_{ℓ}^{s} (x)) \geq 0, \forall ℓ = 1, \dots, s_{k}} .

K_{k} (x) = K_{k}^{c} (x) ⋂ K_{k}^{s} (x) .

K_{k} (x) = K_{k}^{c} (x) ⋂ K_{k}^{s} (x) .

\overset{x}{˙} = \overset{ˉ}{f} (x_{i}) \otimes 1_{n} + (\overset{g}{ˉ} (x_{i}) \otimes I_{n}) u_{1} ⋮ u_{n},

\overset{x}{˙} = \overset{ˉ}{f} (x_{i}) \otimes 1_{n} + (\overset{g}{ˉ} (x_{i}) \otimes I_{n}) u_{1} ⋮ u_{n},

\dot{h}_{ij}^{c} (x_{i}, x_{j}) + \overset{α}{ˉ}_{ρ, γ} (h_{ij}^{c} (x_{i}, x_{j})) \geq 0.

\dot{h}_{ij}^{c} (x_{i}, x_{j}) + \overset{α}{ˉ}_{ρ, γ} (h_{ij}^{c} (x_{i}, x_{j})) \geq 0.

\dot{h}_{ij}^{c} (x_{i}, x_{j}) + \overset{α}{ˉ}_{ρ, γ} (h_{ij}^{c} (x_{i}, x_{j})) \geq 0 \forall j \in N_{k}^{i},

\dot{h}_{ij}^{c} (x_{i}, x_{j}) + \overset{α}{ˉ}_{ρ, γ} (h_{ij}^{c} (x_{i}, x_{j})) \geq 0 \forall j \in N_{k}^{i},

K_{k}^{c, i} = j \in N_{k}^{i} ⋂ K_{k, ij}^{c, i}

K_{k}^{c, i} = j \in N_{k}^{i} ⋂ K_{k, ij}^{c, i}

K_{k, ij}^{c, i} = {u_{i} \in U_{i} ∣ L_{\overset{ˉ}{f}} h_{ij}^{c} + L_{\overset{g}{ˉ}} h_{ij}^{c} u_{i} + \frac{α ˉ _{ρ, γ} ( h _{ij}^{c} )}{2} \geq 0},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A Sequential Composition Framework for Coordinating Multi-Robot Behaviors

Pietro Pierpaoli, Anqi Li, Mohit Srinivasan, Xiaoyi Cai, Samuel Coogan, and Magnus Egerstedt This work was supported by DARPA Grant No. N66001-17-2-4059. Pietro Pierpaoli, Mohit Srinivasan, Samuel Coogan, and Magnus Egerstedt are with the School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA. (email: {pietro.pierpaoli,mohit.srinivasan,sam.coogan,magnus}@gatech.edu) Anqi Li is with the department of Computer Science, University of Washington, Seattle, WA 98195, USA. (email:[email protected]) Xiaoyi Cai is with the department of Aeronautics and Astronautics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA. (email: [email protected]) This paper has supplementary downloadable material available at http://ieeexplore.ieee.org, provided by the authors. This includes a multimedia MP4 format movie (60.1 MB), which shows commented implementation of the ”Securing a Building” mission on a team of differential drive robots.

Abstract

A number of coordinated behaviors have been proposed for achieving specific tasks for multi-robot systems. However, since most applications require more than one such behavior, one needs to be able to compose together sequences of behaviors while respecting local information flow constraints. Specifically, when the inter-agent communication depends on inter-robot distances, these constraints translate into particular configurations that must be reached in finite time in order for the system to be able to transition between the behaviors. To this end, we develop a distributed framework based on finite-time convergence control barrier functions that enables a team of robots to adjust its configuration in order to meet the communication requirements for the different tasks. In order to demonstrate the significance of the proposed framework, we implemented a full-scale scenario where a team of eight planar robots explore an urban environment in order to localize and rescue a subject.

Index Terms:

Multi-Robot Systems, Networked Robots, Control Barrier Functions, Behavior-Based Systems

I Introduction

As our understanding of how to structure control and coordination protocols for teams of robots increases, a number of application domains have been identified, such as entertainment [1][2], surveillance [3] [4], manipulation [5], and search-and-rescue [6]. Along with a decrease in the production and manufacturing costs associated with the platforms themselves, these applications have been enabled by a number of theoretical results that have emerged at the intersection of different disciplines such as robotics, controls, computer science, and graph theory [7].

From a motion controls perspective, one notable requirement is given by the need to define actions capable to solve team-wise objectives on the basis of locally available information. For instance, different extensions of the consensus equation have been used to arrive at locally defined controllers with provable, global convergence properties [8]. In this way, it is possible to construct coordinated controllers for the solution of motion control problems, such as rendezvous [9] [10], cyclic pursuit [11], formation control [12] [13], coverage [14] [3], leader-based control [15], and flocking [16]. Particular instantiations of some of these behaviors are shown in Fig. 1 on a group of six simulated differential drive robots.

For the correct execution of the controllers mentioned, a sufficiently rich set of information needs to be available to the robots. Representing the flow of information between the robots through graphs, with vertices and edges being respectively the robots and the pair-wise ability of sharing information, those conditions can be encoded in terms of particular graphs that need to exist between the robots. For example, rendezvous requires a spanning out-branching tree [15], cyclic-pursuit requires a cyclic graph [11], formation control a rigid graph [15], and a Delaunay graph is required for most of coverage control problems [14].

Even though the coordinated behaviors mentioned above can address a number of different tasks, they have limited utility in the context of real-world missions, which can rarely be represented as single tasks. However, the utility of these behaviors can be greatly expanded if they are sequenced together, which is the primary consideration in this paper. But, for a construction like this to work, it is necessary that the required information is available to the robots as they transition from one behavior to the next.

As such, the problem of composing different behaviors, can be recast in terms of the ability of the robots to establish the interactions needed at each stage of a mission. In particular, when the communication between agents depends on their relative configurations (e.g. relative distance or orientation), realizing a certain communication structure directly affects the configuration of the system, which in turn, affects the execution of the mission itself. In order to overcome this coupling, we separate the problem of generating a sequence of behaviors that corresponds to the solution of a mission objective (e.g. [17]) from their composition. In this work we focus on the problem of designing a composition framework given a sequence of coordinated behaviors. Although the focus of this paper is confined to motion control tasks, our framework is applicable to other forms of autonomous collaboration where desired interaction structures between the robots are required by the mission, e.g., sharing of resources in heterogeneous teams [18] or coordinated manipulation [19].

The contribution of this paper is twofold. Firstly, extending the results in [20], we propose a fully decentralized framework for composing a given sequence of multi-robot coordinated behaviors. Secondly, responding to the lack of established large-scale scenarios for the testing of multi-robot techniques, we propose a scenario called Securing a Building, which is rich and complex enough to capture many challenges and objectives of real-world implementations. The significance of our framework is demonstrated through implementation of the Securing a Building scenario on a team of mobile robots.

The remaining of this paper is organized as follows. In Section III we review the definition of finite-time convergence barrier functions, while in Section IV we present a centralized multi-robot composition framework, which is extended to a fully decentralized formulation in Section VI. In Section VII, we describe the Securing a Building case study and its implementation. Finally, motivated by the lack of well-established scenarios for testing and comparing multi-agent robotics techniques, in Appendix A we discuss supportive arguments for considering the Securing a Building as a multi-robot benchmark scenario.

II Related Work

The problem of partitioning complex objectives into simpler tasks can be solved by sequentially composing primitives, e.g., [21], or by blending them simultaneously in a hierarchical fashion. An example of hierarchical composition for single robot motion control is navigation between points while avoiding obstacles, e.g., [22]. In general, the problem of controlling a system by composing different modes of operation pertains to hybrid systems and multi-modal control domains [23].

Because of the complexity emerging from the composition of distinct controllers, guarantees on the safety and correctness of the final results need to be established [24]. Provable correct composition of control policies is investigated in the formal methods literature. Recently, compositional strategies inspired from formal methods have been used for the development of control strategies for multi-robot systems [25, 26, 27, 28]. In particular, the authors of [25] use tools from linear temporal logic (LTL) for the specification of behaviors to be executed by the system. The solution is based on a sequence of constrained reachability problems, each consisting of a target set to be reached in finite time and a safety set within which the system must stay at all times. A related approach is developed in [26], where the problem of prescribed-time convergence to spatio-temporal specifications is formulated using control barrier functions. The authors in [27] discuss a hierarchical decomposition method for controller synthesis given LTL specifications.

In the context of controllers composition for multi-robot systems, in [29] the authors use symbolic methods in order to generate high-level instructions from form of human-like language. The authors of [30] introduce a framework for the composition of controllers in robotic systems using Petri Nets. In [31], behaviors from the Null-Space-Behaviors framework are combined in order to solve ad-hoc tasks, such as perimeter patrol. A supervisor, represented as a finite state automata, selects high-level behaviors by assembling low-level behaviors. In [17], a revised version of the $A^{*}$ algorithm is used to generate an optimal path of behaviors, such that the overall cost of the mission is minimized. Similarly, in [32] motion planning for a team of quadcopters is solved by defining higher level motion primitives obtained by a spatial partition of the environment. However, none of these approaches specifically address the problem of correct composition between primitives, which is the focus of this paper.

As discussed in the previous section, coordination between agents is possible only if particular interactions exist between the robots. In multi-robot systems, interaction requirements are commonly investigated in terms of connectivity maintenance, i.e., a certain graph or node-connectivity needs to be guaranteed at all times. Methods employed in the solution to this problem include edge weight functions [33], control rules based on estimate of algebraic connectivity [34], hybrid control [35], passivity [36], and barrier functions [37]. If connectivity between agents needs to be guaranteed in non-nominal circumstances, resilient solutions must be in place as well, e.g., [18], [38], and [39]. Notably, a technique based on graph process specifications for the sequential composition of different multi-agent controllers is discussed in [40]. Similar to our work, the authors in [40] bridge the gap between composition of controllers and the topology requirements by encoding requisites for each controller in terms of graphs. However, while in [40] incompatible controllers are combined through the introduction of a bridging controller, in our approach controllers are minimally modified by the robots in order to satisfy upcoming requirements. Our approach significantly reduces the complexity of the composition process, minimizes the energy spent by the robots to switch between behaviors, and can accommodate additional constraints, such as inter-robot collisions and obstacles avoidance.

III Finite-Time Barrier Functions

In this section we review the general definition of Finite-time Convergence Control Barrier Function (FCBF) which was first introduced in [20] and inspired by the finite-time stability analysis for autonomous system introduced in [41]. Given a dynamical system operating in an open set $\mathcal{D}\subseteq\mathbb{R}^{n}$ and a set $\mathcal{C}\subset\mathcal{D}$ , barrier functions [42] are Lyapunov-like functions that guarantee forward invariance of $\mathcal{C}$ with respect to the state of the system. In other words, if an appropriate barrier function exists, it can be used to show that if the state of a system is in $\mathcal{C}$ at some time, it will be in $\mathcal{C}$ thereafter. The concept of barrier functions was extended to Zeroing Control Barrier Functions (ZCBF) in [42], where asymptotic convergence of the state to the set $\mathcal{C}$ was discussed. Thus, provided that an appropriate ZCBF exists, if the state of the system is not in $\mathcal{C}$ at some initial time, it will asymptotically converge to $\mathcal{C}$ .

As discussed in the introduction, before execution of a coordinated behavior, robots need to satisfy certain spatial configurations imposed by the behavior itself. Importantly asymptotic convergence to the correct configuration is not sufficient. In fact, if we consider $\mathcal{C}$ as the joint set of all initial configurations required for a particular behavior, the state must strictly belong to $\mathcal{C}$ for the behavior to work properly. Following this observation, the need for a finite-time convergence extension of the previous concepts becomes clear. In particular, denoting the state of the system as $x(t)\in\mathcal{D}$ , we are interested in verifying the following conditions:

•

if $x(t_{0})\in\mathcal{C}$ , then $x(t)\in\mathcal{C}$ for all $t>t_{0}$

•

if $x(t_{0})\notin\mathcal{C}$ , then $x(t)\in\mathcal{C}$ for some $t_{0}<t<\infty$ .

In order to do this, we encode the set $\mathcal{C}\subset\mathcal{D}\subseteq\mathbb{R}^{n}$ , through the superzero-level set of a continuously differentiable function $h:\mathcal{D}\rightarrow\mathbb{R}$ , i.e.,

[TABLE]

Definition III.1

We introduce the following class- $\mathcal{K}$ function

[TABLE]

with $\rho\in[0,1)$ and $\gamma>0$ , which is continuous everywhere and locally Lipschitz everywhere except at the origin [41].

Definition III.2

[20]** For a dynamical system

[TABLE]

with $x\in\mathcal{D}$ , $u\in U\subset\mathbb{R}^{m}$ , and for a set $\mathcal{C}$ induced by $h$ , if there exists a function $\bar{\alpha}_{\rho,\gamma}(h(x))$ of the form (2) such that

[TABLE]

then, the function $h$ is a Finite-time Convergence Barrier Function (FCBF) defined on $\mathcal{D}$ .

Following from the definition above, we define the set of admissible control inputs as

[TABLE]

Theorem III.3

[20]** Given a set $\mathcal{C}\subset\mathbb{R}^{n}$ , any Lipschitz continuous controller $\mathcal{U}:\mathcal{D}\mapsto U$ such that

[TABLE]

renders $\mathcal{C}$ forward invariant for the system (3). Moreover, given an initial state $x_{0}\in\mathcal{D}\backslash\mathcal{C}$ , the same controller $\mathcal{U}$ results in $x(T)\in\mathcal{C}$ , where

[TABLE]

In conclusion, by selecting a controller that verifies condition (6), both forward invariance and finite-time convergence to the desired set are guaranteed.

IV Problem Formulation

We denote the state of a team of $n$ homogeneous mobile robots operating in a $d$ -dimensional and connected domain $\mathcal{D}$ as $x(t)=[x_{1}(t)^{T},\dots,x_{n}(t)^{T}]^{T}\in\mathcal{D}\subset$ $\mathbb{R}^{dn}$ where $x_{i}(t)\in\mathbb{R}^{d}$ is the position of robot $i$ at time $t$ . As part of the coordinated nature of the behaviors being performed by the robots, each robot executes a control protocol which depends on the state of the subset of robots with which it interacts. We assume robots can communicate if the distance between them is less or equal to a sensing threshold $\Delta\in\mathbb{R}_{>0}$ . Thus, the list of possible interactions between agents are described by a time-varying, undirected, proximity graph $\mathcal{G}(t)=(V,E(t))$ , where $V=\{1,\dots,n\}$ is the set of nodes representing the robots and $E(t)$ is the set of interacting pairs at time $t$ , where

[TABLE]

For each robot $i=1,\dots,n$ , we denote the set of available neighbors at time $t$ as $\mathcal{N}_{i}(t)=\{j\in V\,|\,(i,j)\in E(t)\}$ , which depends on the position of the robots at time $t$ .

The ensemble dynamics of the multi-agent system is described by

[TABLE]

where $f$ and $g$ are continuous locally Lipschitz continuous functions and $u=[u_{1}^{T},\dots,u_{n}^{T}]^{T}\in U\subset\mathbb{R}^{m}$ is the vector of inputs, which depends on the particular behavior being executed. At all times, the control input $u$ in (9) is given by a controller $\mathcal{U}$ , which can be defined as a state feedback law $\mathcal{U}:\mathcal{D}\mapsto U$ or by a combination of both external parameters and state feedback law $\mathcal{U}:\mathcal{D}\times\Theta\mapsto U$ , where $\Theta$ is a space of parameters appropriate for the behavior. For instance, the controller corresponding to a weighted consensus belongs to the first case. On the other side, a leader-follower protocol where followers maintain prescribed inter-agent distances is described by a controller that depends on both state feedback (followers’ control) and exogenous parameters (leader’s goal) (see Section VI for examples).

We represent a mission by an ordered sequence of $M$ coordinated behaviors

[TABLE]

The $k^{\text{th}}$ behavior in $\pi$ is defined by the pair

[TABLE]

where $\mathcal{U}_{k}$ represents the coordinated controller described above and $\mathcal{G}_{k}$ is the interaction graph required by behavior $\mathcal{B}_{k}$ to function properly. We assume the list of behaviors $\pi$ to be fixed and available to all robots. We will use the term behavior to refer to a generalized multi-robot controller in the form (11) and to task as the objective of the controller.

As discussed in Section I, each behavior requires a certain interaction structure between the robots (i.e., pairs of robots that need to be neighbors). With reference to (11), we describe an interaction structure via the graph $\mathcal{G}_{k}=(V,E_{k})$ . Thus, denoting by $t_{k}^{\vdash}$ and $t_{k}^{\dashv}$ the starting and ending times for behavior $k$ , the robots’ configuration needs to satisfy $\mathcal{G}_{k}\subseteq\mathcal{G}(t)$ for all $t\in[t_{k}^{\vdash},t_{k}^{\dashv}]$ . In other words, as shown in Fig. 2, the interaction structure required by each behavior needs to be a spanning graph of the graph induced by the state of the agents during the interval of time the behavior is executed. At this point, given a list of behaviors constituting the mission $\pi$ and the corresponding multi-robot controllers, we want to design a procedure that enables robots to assemble and maintain the communication graph required by each behavior.

Problem IV.1

Given an ordered sequence of coordinated behaviors $\pi=\{\mathcal{B}_{1},\dots,\mathcal{B}_{M}\}$ , where each $\mathcal{B}_{k}=\{\mathcal{U}_{k},\,\mathcal{G}_{k}\}$ can be completed by the robots in finite-time, design a feedback control policy to compose the behaviors such that

[TABLE]

V Composition of Coordinated Behaviors

In addition to the list $\pi$ , transitions between behaviors need to be synchronized, i.e., for each behavior $\mathcal{B}_{k}$ , $k=1,\dots,M$ , robots must 1) start assembling $\mathcal{G}_{k+1}$ only after all robots have completed $\mathcal{B}_{k}$ and 2) start executing $\mathcal{B}_{k+1}$ only after condition $\mathcal{G}_{k+1}\subseteq\mathcal{G}(t)$ is satisfied. We assume the existence of a discrete counter $\sigma\in[1,\dots,M]$ which indicates the active behavior and a binary signal

[TABLE]

which describes whether the interaction structure required by behavior $\mathcal{B}_{\sigma}$ is available. In this section, we assume both signals to be controlled by a supervisor and made available to the robots at all times, e.g., through a dedicated static communication network. In the next section, we discuss the extension to a fully distributed framework.

Following from the communication modality assumed for the robots, communication constraints can be expressed in terms of relative distance between the robots. In other words, behavior $\mathcal{B}_{k}$ can be correctly executed if, for all $t\in[t_{k}^{\vdash},t_{k}^{\dashv}]$ , all the distances between pairs in $E_{k}$ are below the proximity threshold $\Delta$ . To this end, a convenient pair-wise connectivity FCBF can be defined as

[TABLE]

and we note that if $\|x_{i}-x_{j}\|\leq\Delta$ , then $h_{ij}^{c}(x)\geq 0$ . In addition, the edge-level and ensemble-level connectivity constraint sets for behavior $\mathcal{B}_{k}$ are

[TABLE]

Following the definition given in (5), the admissible set of control inputs that guarantees finite-time convergence to $\mathcal{C}^{c}_{k}$ is:

[TABLE]

Theorem V.1

Denoting with $x_{0}$ the initial state of the system with dynamics (9), any controller $\mathcal{U}:\mathcal{D}\mapsto U$ such that $\mathcal{U}(x_{0})\in K_{k}^{c}(x_{0})$ for all $x_{o}\in\mathcal{D}$ , will drive the system to $\mathcal{C}^{c}_{k}$ within time

[TABLE]

Proof:

Consider all pairs of agents $i$ and $j$ , such that $(i,j)\in E_{k}$ . If $h_{ij}^{c}(x_{0})\geq 0$ , i.e., agents $i$ and $j$ are within communication distance, the forward invariance property of $\mathcal{U}$ , guarantees that $i$ and $j$ will stay connected. In this case, the state will reach $\mathcal{C}_{ij}^{c}$ , within time $T_{ij}=0$ . On the other side, consider $h_{ij}^{c}(x_{0})<0$ . Any $\mathcal{U}(x_{0})\in K_{k}^{c}(x_{0})$ satisfies the finite-time convergence barrier certificates, and because of Theorem III.3, if $x_{0}\notin\mathcal{C}_{ij}^{c}$ , then $x(T_{ij})\in\mathcal{C}_{ij}^{c}$ , with

[TABLE]

Since every communication constraint $\mathcal{C}_{ij}^{c}$ will be reached within time $T_{ij}$ , the total time required to drive $x(t)$ to $\mathcal{C}_{k}^{c}$ is upper bounded by

[TABLE]

∎

When selecting control inputs from set (17), the system (9) will satisfy requirements for behavior $\mathcal{B}_{k}$ in finite time.

V-A Finite-Time Convergence Control Barrier Functions

Once behavior $\mathcal{B}_{k-1}$ is completed, robots are required to converge to the set $\mathcal{C}^{c}_{k}$ before behavior $\mathcal{B}_{k}$ can start. Under the lead of the external supervisor, the change of behavior is communicated to the robots through the signal $\sigma$ , which transitions from $k-1$ to $k$ once $\mathcal{B}_{k-1}$ is completed. Now, although finite-time convergence to $\mathcal{C}^{c}_{k}$ can be achieved by selecting any control input in $K_{k}^{c}(x)$ , we seek to minimally perturb the execution of the behavior just concluded, namely $\mathcal{B}_{k-1}$ . This can be accomplished by solving a problem similar to the one proposed in [43], which we adapt to our framework. Denoting with $\hat{u}_{k}=\mathcal{U}_{k}(x)$ the nominal control input from behavior $\mathcal{B}_{k}$ , during transition between $\mathcal{B}_{k-1}$ and $\mathcal{B}_{k}$ the actual control input to the robots $u^{*}$ is defined as

[TABLE]

subject to

[TABLE]

for all $(i,j)\in E_{k-1}\cup E_{k}$ . Once all required edges $E_{k}$ are established (i.e., $\eta=1$ ), edges in $E_{k-1}$ are no longer necessary. At this point, under the effect of the controller $\mathcal{U}_{k}$ , the list of constraints in (22) is substituted with

[TABLE]

for all $(i,j)\in E_{k}$ . Since the cost function is convex and the inequality constraints (22) and (23) are control affine, the problem can be solved in real-time. In conclusion, because of the finite-time convergence and forward invariance properties of the above formulation, if $\mathcal{B}_{k-1}$ can be completed and a solution to (21-22) (or (21-23)) exists, robots will converge to the configuration required by $\mathcal{B}_{k}$ , and maintain it throughout its execution.

Remark V.1

The solution of (21-22) (or (21-23)) is contingent upon the existence of a control input capable to solve all constraints. In other words, $K_{k}^{c}(x)\cap K_{k+1}^{c}(x)$ (or $K_{k}^{c}(x)$ ) should not be empty for all times. For this, it is necessary that a robot’s configuration that satisfies all constraints of the problem exists. However, this is not sufficient as the progress towards the desired configuration might be obstructed by constraints on the actuators or deadlock configurations. Although we do not address this directly, it is possible to mitigate feasibility issues by considering, for example, constraints relaxation, sum of squares barrier functions, or pre-defined back-up controllers (see [44] and references therein).

V-B Initial Constraints

In addition to the communication constraints considered above, certain missions might require additional conditions to be met before each behavior can start. For example, during the exploration tasks it might be desirable for one robot to always stay within range of communication with a human-operator, or to maintain a minimum distance from an unsafe area. Assuming $\mathcal{B}_{k}$ requires a number of distinct $s_{k}$ of such constraints, we encode the entire set of initial conditions through a list of barrier functions $h_{\ell}^{s}(x)$ , with $\ell=1,\dots,s_{k}$ :

[TABLE]

Following this definition, we define a set of admissible control inputs similar to the one in (17) that will drive the state of the system to the desired set within finite time:

[TABLE]

The set of controls satisfying both communication and initial conditions constraints can thus be obtained by intersection of set (25) and (17):

[TABLE]

We note that the results in Theorem V.1 and the formulation of minimally invasive controller in (21) still holds valid by considering the set $K_{k}(x)$ instead of $K_{k}^{c}(x)$ as the set of admissible control inputs.

VI Distributed Composition of Behaviors

The composition framework discussed in the previous section reduces to the team-wise minimum norm controller (21), which is not directly solvable by individual robots. In addition to this, a centralized supervisor is needed in order to synchronize behavior transitions. In this section, we formulate a decentralized solution to problem IV.1 which can be implemented by the robots using only information from their neighbors. Furthermore, we also include those additional constraints necessary for the safe operations of the robots, e.g., inter-agent collisions and obstacles avoidance [37]. The formulation is derived following the approach described in [45], which we adapt here to our framework.

VI-A Distributed Finite-Time Convergence Control Barrier Functions

The limitation in solving problem (21) in a distributed fashion is represented by the fact that knowledge of dynamics, input $\hat{u}$ , and state $x$ for the entire team need to be available. In addition, solution of (21), provides the control inputs for the entire team, which are unnecessary to the individual robots.

In order to develop the correct decentralized formulation of (21), we first define a decomposition of the dynamics (9). We denote by $\mathcal{D}_{i}\subset$ $\mathbb{R}^{d}$ and $U_{i}\subset$ $\mathbb{R}^{m}$ configuration space and set of feasible controls for agent $i$ respectively. In addition, by denoting with $\bar{f},\,\bar{g}:\mathcal{D}_{i}\mapsto$ $\mathbb{R}^{d}$ the node-level terms of the control affine dynamics of agent $i$ , the ensemble dynamics can be written as:

[TABLE]

where $u_{i}\in U_{i}$ is the $i^{\text{th}}$ robot’s control input, $\otimes$ is the Kronecker product, and ${\bf 1}_{n}$ and $I_{n}$ are vector of ones and identity matrix of size $n$ respectively.

Let’s consider two sequential behaviors $\mathcal{B}_{k-1}$ and $\mathcal{B}_{k}$ . Upon completion of $\mathcal{B}_{k-1}$ , for all edges $(i,j)\in E_{k}$ , robots’ configuration should satisfy

[TABLE]

From the $i^{\text{th}}$ robot’s point of view, the set of constraints that need to be satisfied in order to execute the new behavior are

[TABLE]

where we recall that $\mathcal{N}_{k}^{i}$ is the set of neighbors to robot $i$ required by behavior $\mathcal{B}_{k}$ . However, since constraint (29) appears exactly twice across the team of robots, it can be relaxed by considering the admissible set of control inputs

[TABLE]

with

[TABLE]

where dependence from $x_{i}$ and $x_{j}$ is omitted for clarity.

Theorem VI.1

Denoting with $x_{0}=[x_{0,1}^{T},\dots,x_{0,n}^{T}]^{T}$ the initial state of a multi-agent system with dynamics described as in (27), any controller $\mathcal{U}_{i}:\mathcal{D}_{i}^{|\mathcal{N}_{k}^{i}|}\mapsto U_{i}$ such that $\mathcal{U}_{i}(x_{0})\in K_{k}^{c,i}$ for all $x_{0}\in\mathcal{D}_{i}^{|\mathcal{N}_{k}^{i}|}$ , will drive the ensemble state to $\mathcal{C}^{c}_{k}$ within time

[TABLE]

Proof:

From Theorem III.3, agents $i$ and $j$ , with $(i,j)\in E_{k}$ , will satisfy $h_{ij}^{c}\geq 0$ in finite time if

[TABLE]

Considering the node level dynamics in (27), the constraint (33) reduces to

[TABLE]

which will be satisfied if both agents $i$ and $j$ satisfy the constraint

[TABLE]

In addition, as discussed in Theorem V.1, constraint (34) will still be satisfied at time

[TABLE]

The same argument can be repeated for all pairs $(i,j)\in E_{k}$ , and condition $\mathcal{G}_{k}\subseteq\mathcal{G}(t)$ will be satisfied within time

[TABLE]

∎

Applying the same design principle described in Section V-A, the minimally invasive control action can be computed by each robot as

[TABLE]

subject to

[TABLE]

Similarly to constraint (22), once all edges in $E_{k}$ are available, constraint (39) is substituted with

[TABLE]

which remains active until $\mathcal{B}_{k}$ is completed.

We note that, in order for agent $i$ to respect (40), the only external information needed is the state of all current neighbors, i.e. $x_{j}$ for all $j\in\mathcal{N}^{i}_{k}$ . On the other side, in order to respect (39), robots need to have access to the state of the future neighbors. This requirement can be satisfied through a state estimation scheme (e.g. EKF [46]), which in turn requires knowledge of robots’ dynamics (known for homogeneous teams) or network localization techniques [47].

Remark VI.1

The ability of each robot to have access to an estimate of their future neighbors’ state does not eliminate the necessity of establishing neighborhood relationships. In fact, a certain proximity structure between robots might be required by desired controllers’ performance that cannot be met through state estimations, or by collaboration tasks that require physical interaction between the robots, e.g. collaborative manipulation [19], sharing of resources [18].

VI-B Additional Constraints

In addition to the proximity constraints discussed above, additional limitations might be imposed on the robots’ configuration by the mission and the environment. For illustrative purposes, we consider inter-robots collisions and obstacle avoidance. Following the approach described in [37], we encode each pair-wise separation condition through the following barrier certificate

[TABLE]

and the minimum separation $D_{a}$ between the robots is satisfied if $h_{ij}^{a}(x)\geq 0$ , for all physical neighbors $j\in\mathcal{N}^{i}(t)$ .

Similarly, avoidance of fixed obstacles can be introduce by considering $M$ ellipsoidal regions of the domain, described by centers $o=[o_{1}^{T},\dots,o_{M}^{T}]^{T}$ . For every agent-obstacle pair $(i,m)$ we define a pairwise barrier function as

[TABLE]

The object avoidance constraints are satisfied if $h_{im}^{o}(x)\geq 0$ , for all $i\in V$ and $m\in\{1,\dots,M\}=\mathcal{I}_{M}$ .

Collecting all the constraints, we expand the problem formulation in (21) to

[TABLE]

where $\alpha$ is a locally Lipschitz extended class- $\mathcal{K}$ function and the first constraint is replaced by (39) during transitions. In conclusion, if there exists a set of control inputs $u=[u_{1},\dots,u_{N}]$ that simultaneously satisfies all constraints in (44), for all behaviors $k=1,\dots,M$ , Problem IV.1 will be solved by the robots.

VI-C Decentralized Behaviors Sequencing

For the correct execution of the behaviors sequencing, each robot should start assembling a new graph only after all other robots have completed the current behavior. Similarly, a new behavior should start once all robots satisfy the neighbors’ requirements for it. Now, we describe a decentralized strategy that allows execution of these two transitions without the need of a supervisor, nor synchronization between the robots.

With reference to Fig. 3, at any given time, each robot’s mode of operation is described by a binary variable $\alpha_{i}$ that describes whether robot $i$ is assembling the graph for an upcoming behavior ( $\alpha_{i}=1$ ) or executing a behavior ( $\alpha_{i}=0$ ). Without loss of generality, assume robots’ initial configuration satisfies the communication requirements for the first behavior, which is then executed ( $\alpha_{i}=0$ ). Once all robots have completed the first behavior, they start assembling the graph required by the following one ( $\alpha_{i}=1$ ), while minimally perturbing the behavior just concluded. Once the new graph is satisfied $\mathcal{G}_{2}\subseteq\mathcal{G}(t)$ , robots start behavior $\mathcal{B}_{2}$ and exit from assembly mode ( $\alpha_{i}=0$ ). This process repeats, until no successive behavior exists.

A correct execution of this process requires robot to agree on when to perform transitions $\alpha_{i}=0\rightarrow 1$ and $\alpha_{i}=1\rightarrow 0$ . To this end, we take inspiration from the consensus-based algorithm described in [48] and we note that this choice is not central to the contribution of this paper. For each robot, we define a binary variable available only to robot $i$ , $s_{t,i}\in\{0,1\}$ that denotes whether robot $i$ itself has completed its current task $s_{t,i}=1$ ( $s_{t,i}=0$ if robot has not completed its current task). In addition, we introduce a variable $\sigma_{i}\in\mathbb{R}_{+}$ , shared among neighbors, continuously updated through the following consensus-based process

[TABLE]

where $\sigma_{i}^{+}$ represent the variable’s value after the update. Owing to the diffusion of $\sigma_{1},\dots,\sigma_{N}$ throughout the network, we can interpret $\sigma_{i}$ ’s as local measures of the team-wise completion of a task. As proved in [48], if $s_{t,i}=1$ for all $i=1,\dots N$ (i.e., all robots are capable to complete the current behavior), by following (45), $\lim_{t\rightarrow\infty}\sigma_{i}=1$ , for all $i=1,\dots N$ . Therefore, robot $i$ starts assembling a new communication graph once the value of $\sigma_{i}$ is close enough to $1$ (see [48] for a discussion on how to choose the switching threshold). A similar process is used for the transition $\alpha_{i}=1\rightarrow 0$ , where we replace $s_{t,i}$ and $\sigma_{i}$ with $s_{a,i}$ and $\eta_{i}$ respectively. The distributed sequencing procedure is summarized in Algorithm 1.

VI-D Applications

We implemented the distributed sequencing framework on the Robotarium [49], on a team of $5$ differential drive robots. For this example, controllers are designed assuming a single integrator model, i.e. $\bar{f}(x_{i})=[0,0]^{T}$ and $\bar{g}(x_{i})=I_{2}$ . In this example, robots execute a transition between two behaviors, where $\mathcal{B}_{1}$ is a cyclic-pursuit behavior and $\mathcal{B}_{2}$ is a formation assembly with leader. Cyclic-pursuit behavior is obtained through the following controller:

[TABLE]

where $R(\phi)\in SO(2)$ is the rotation matrix of angle $\phi$ , which is related to the desired cycle radius. Importantly, for this behavior to work, the communication graph $\mathcal{G}_{1}$ must be a cycle graph. Considering robot $1$ as leader, the formation control behavior can be achieved with

[TABLE]

where $\theta_{ij}\in\mathbb{R}_{+}$ is the desired inter-robot distance, $x_{g}\in\mathcal{D}$ is the leader’s goal, and $\gamma_{g}\in\mathbb{R}_{+}$ the corresponding proportional gain. In the case of formation control, it is known that the Euclidean embedding of $\mathcal{G}_{2}$ must be a rigid framework (see for instance [15] and references therein). With reference to Fig. 4, robots initially execute $\mathcal{B}_{1}$ for a certain amount of time (a). Once completed (b) (green patches represent robots that have completed their current behavior), robots start assembling $\mathcal{G}_{2}$ (c), after which $\mathcal{B}_{2}$ is executed until $\|\hat{u}_{i}\|$ is below a pre-defined threshold (d-f).

In Fig. 5 we can observe the value of the two consensus variables $\sigma_{i}$ and $\eta_{i}$ for all robots during the behavior transition. Background colors represent the time intervals during which the two behaviors were executed, while the darker region in the middle corresponds to the assembly of the new graph. We observe the assembly and task variables $\eta_{i}$ and $\sigma_{i}$ approaching the value $1$ simultaneously for all robots, thus triggering a synchronized start of the successive phase.

The robustness of our technique was tested by simulating uniformly distributed delays between the robots. Results for this case are shown in Fig. 6 where we observe that although convergence of $\eta_{i}$ and $\sigma_{i}$ is no longer monotonic, robots still reach agreement on when to switch to the successive phase.

Finally, in order to show the benefits of the minimally invasive approach, we compare it with an alternative technique inspired by [40], where, upon collective completion of a behavior, robots execute rendezvous until the communication graph required by the successive behavior is assembled. As shown by the simulation results for a sequence of $7$ behaviors (Fig. 7), the mean of the input’s norm when considering our framework (red solid line) is always lower than the one obtained using the rendezvous as glue behavior. Importantly, since transitions between behaviors occur faster in the minimally invasive case, the lower control effort cannot be attributed to a more relaxed choice of controller gains.

VII Case Study: Securing a Building

The objective of this section is to describe the Securing a Building mission, which will be used as testing scenario for the composition framework. We describe now the main structure and objective of the mission, while we deconstruct it into coordinated behaviors in the next subsection.

VII-A Mission Overview

In the Securing a Building mission, a group of robots are deployed in an urban environment to identify an unknown target building and rescue a subject located inside. Based on [50], we decompose this mission into the following 4 phases:

FIND - First, the robots are tasked with identifying the target building by means of surveillance of the perimeters of all the buildings. For efficient exploration, robots can be broken into sub-teams. Each team reports collected information at the base after each building has been investigated. Once the target building has been identified, the robots reunite and prepare for the next phase.

ISOLATE - The robots isolate the target building by patrolling its perimeter. To achieve this, the robots are divided into two subgroups - the security agents responsible for boundary protection and the maneuvering agents tasked with entering the building.

RESCUE - During the rescue phase, the security agents keep patrolling around the building. In the meanwhile, the maneuvering agents enter the building, clear the rooms, and seize positions as they maneuver through the building to find the subject to be rescued. Once the subject has been located, the robots transport it to the safe zone.

FOLLOW-THROUGH - As the interior of the building is being cleared, individual robots are left inside as beacons, while the remaining robots from the maneuvering agents leave the building, gather on the outside with the security agents, and report back to the base station.

A number of arguments support the choice of the Securing a Building mission as an ideal scenario for testing multi-robot techniques and algorithms. First, the requirement of spatially diverse functionalities that cannot be provided by single robots naturally requires the use of multi-robot systems. Second, the final goal of the mission, namely rescuing the subjects of interest, reflect the fact that general real-world missions cannot be accomplished with single controllers. Lastly, thanks to its modularity, techniques focusing on specific aspects of the mission can be integrated and tested without influencing the overall structure of the mission (see the Appendix for details).

VII-B Securing a Building Through Composition of Behaviors

We deconstruct the Securing a Building mission through ordered sequences of coordinated behaviors. The process is summarized in Fig. 8. We refer to behaviors in terms of their main objectives, acknowledging that different implementations can be used to achieve the same results. We highlight these behaviors in parenthesis.

FIND

Robots initially coordinate with the operator at the base station (rendezvous). After that, robots are divided into different search teams, each assigned with a list of buildings to investigate (task allocation). Subsequently, all the teams investigate their own lists of buildings. First team of robots travels to the vicinity of a building (leader-follower), then start to survey the exterior of the building (perimeter patrol), and return to the base (leader-follower). This process repeats until the target building is discovered.

ISOLATE

Robots gather near the base (rendezvous), then are divided into security and maneuvering agents (task allocation). After traveling from the base to the vicinity of the target building (go-to-goal), security agents protect the building’s perimeter (cyclic pursuit), until the end of the RESCUE phase. Meanwhile, the maneuvering agents locate the building’s entrance, by following its perimeter (perimeter patrol). Once the entrance has been found, the maneuvering agents gather at the entrance (rendezvous) and create a formation (formation control) before entering.

RESCUE

The maneuvering agents enter the building in formation (formation control) and cover the interior area (area coverage). Once the location of the subject to rescue is identified, the robots form a circular closure around the subject (cyclic pursuit). Then, the robots transport the subject to the safety zone, while maintaining the circular closure around the subject (containment control).

FOLLOW-THROUGH

Maneuvering agents spread (scatter) over the interior of the building. To signify that the area has been cleared, few robots are left inside the building as beacons (persistent coverage). The rest of the maneuvering agents and the security agents reunite outside the building (rendezvous). At last, they return to the base (leader-follower).

VII-C Results

We tested the behavior composition framework described in Section VI on the Securing a Building mission, which was executed on the Robotarium [49]. In Fig. 9, we display selected snapshots of the mission obtained by a camera mounted on the ceiling. In the experiment, $8$ differential-drive robots, indexed $1,\dots,8$ are deployed in a simulated urban environment composed of $6$ buildings, blue polygons indexed $1,\dots,6$ . In this experiment, we simulate a maximum sensor range $\Delta=0.5$ m. Because of the different spatial scales between FIND/ISOLATE phases and RESCUE/FOLLOW-THROUGH phases, the entire mission is divided in two parts. In the first part (Fig. 9(a) to Fig. 9(d)) the experiment is performed at a neighborhood-level scale. The remaining two phases are executed in a zoomed-in environment, which focuses on the one building of interest (Fig. 9(d) to Fig. 9(f)).

During FIND phase (Fig. 9(a) and 9(b)), two groups of robots $\text{\sc team}1:\{1,2,3,4\}$ and $\text{\sc team}2:\{5,6,7,8\}$ investigates preassigned lists of buildings, leaving some agents near the base station (the purple filled dot in the top right corner) if destination building cannot be reached without breaking the connectivity constraints. The red polygon in Fig. 9(b) and 9(c) is the target building after being identified by $\text{\sc team}1$ . During the ISOLATE phase (Fig. 9(c)), maneuvering agents look for the entrance, while the security agents secure the outer perimeter.

During the RESCUE phase (Fig. 9(d) to 9(e)), the agents inside the building, i.e. $\text{\sc team}1$ , localize the target (red dot) using Voronoi coverage (Fig. 9(d)) and escort it to the safe area (red circle) as shown in (Fig. 9(e)). Finally, robots $1$ and $2$ are left as beacon inside the building, while remaining robots return to the base (Fig. 9(f)).

VIII Conclusion

Sequential execution of multi-robot coordinated behaviors can be employed to solve real-world complex missions. However, sequences of behaviors can be executed only if the robots meet all required communication constraints in finite time. In this paper, we described a distributed framework for the sequential composition of coordinated behaviors designed on finite-time convergence control barrier functions. The resulting composition framework is formulated in the form of a quadratic program, which is solved locally by individual robots. Although the focus of this paper is on coordinated motion, the application of the proposed framework is relevant to other form of autonomous collaborations where the robots need to satisfy prescribed pair-wise proximity requirements that change over time. Finally, a large-scale multi-task scenario, denoted “Securing a Building” mission is proposed as an ideal environment for testing multi-robot techniques.

Appendix A Securing a Building as Benchmark Scenario

Testing the performance of techniques and algorithms for the control of multi-robot systems in real-world scenarios is a challenging task. This is particularly true when addressing novel approaches, as the focus on specific aspects of the problem might obscure all-around performance assessments. To this end, thanks to its modularity, the Securing a Building mission is an ideal testing framework. In this section, we suggest a number of selected research topics, for which this mission could serve as a testing framework when aiming to evaluate performance of new techniques. This appendix is by no mean proposed as a complete list of subjects relevant to multi-robot systems but rather as a discussion to stimulate application of the Securing a Building as a versatile, real-world testing scenario.

Team Assembly

Considerable efforts have been devoted to the development of team composition techniques for heterogeneous robots [51], [52]. Based on the skill set required to solve a particular task, e.g., certain actuation, sensing, locomotion, or communication capabilities, the question is to find a recruitment rule that produces a team capable of delivering the best performance. For instance, in the RESCUE phase, robots capable of opening doors may be required for the maneuvering agents, while agility and communication capabilities might be preferred during the FIND phase.

Communication

In the context of autonomous networked systems, central roles are played by the flow of information between agents, and the infrastructure required for it [53]. A number of questions can be posed in relation to the distribution of agents over a domain, given the constraints of communication systems, such as limited range, power requirements, and privacy of the information.

Unknown Environment

The amount of prior knowledge about the environment plays an important role in the definition of both low-level robot controllers and high-level mission plans. The performance of distributed solutions to the localization and mapping problems [54] can be tested on the Securing a Building. Aspect of interest include balancing between exploitation and exploration of the environment applied, for instance, to the building exploration planning.

Resilience

Failure of the mission can be attributed to factors such as damaged components, sensing errors, communication dropouts, delays, control disturbances, reduction of functionalities due to adversarial attacks, etc. A number of different research thrusts focus on the problem of detecting and responding to faults and malicious attacks in multi-agent and cyber-physical systems [55, 56, 57].

Bibliography57

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] E. Ackerman, “Flying lampshadebots come alive in cirque du soleil performance,” i EEE Spectrum , 2014.
2[2] X. Du, C. E. Luis, M. Vukosavljev, and A. P. Schoellig, “Fast and in sync: Periodic swarm patterns for quadrotors,” ar Xiv preprint ar Xiv:1810.03572 , 2018.
3[3] M. Santos, Y. Diaz-Mercado, and M. Egerstedt, “Coverage control for multirobot teams with heterogeneous sensing capabilities,” IEEE Robotics and Automation Letters , vol. 3, no. 2, pp. 919–925, 2018.
4[4] D. Shishika and V. Kumar, “Local-game decomposition for multiplayer perimeter-defense problem,” in 2018 IEEE Conference on Decision and Control (CDC) . IEEE, 2018, pp. 2093–2100.
5[5] H. Han and R. G. Sanfelice, “A hybrid control algorithm for object grasping using multiple agents,” in 2018 IEEE Conference on Control Technology and Applications (CCTA) . IEEE, 2018, pp. 652–657.
6[6] J. Suarez and R. Murphy, “A survey of animal foraging for directed, persistent search by rescue robotics,” in 2011 IEEE International Symposium on Safety, Security, and Rescue Robotics . IEEE, 2011, pp. 314–320.
7[7] D. Zelazo, M. Mesbahi, and M.-A. Belabbas, “Graph theory in systems and controls,” in 2018 IEEE Conference on Decision and Control (CDC) . IEEE, 2018, pp. 6168–6179.
8[8] J. Cortés and M. Egerstedt, “Coordinated control of multi-robot systems: A survey,” SICE Journal of Control, Measurement, and System Integration , vol. 10, no. 6, pp. 495–503, 2017.