On Steering Swarms

Ariel Barel; Rotem Manor; Alfred M. Bruckstein

arXiv:1902.00385·cs.MA·February 4, 2019

On Steering Swarms

Ariel Barel, Rotem Manor, Alfred M. Bruckstein

PDF

Open Access

TL;DR

This paper introduces a new method for externally controlling swarms of identical agents using simple broadcast signals based on the average location, without needing individual agent communication or location data.

Contribution

It presents a novel approach enabling external steering of indistinguishable agents through global signals derived from the swarm's average position.

Findings

01

Effective external control of swarms demonstrated

02

No individual agent communication required

03

Works despite agents lacking absolute location information

Abstract

The main contribution of this paper is a novel method allowing an external observer/controller to steer and guide swarms of identical and indistinguishable agents, in spite of the agents' lack of information on absolute location and orientation. Importantly, this is done via simple global broadcast signals, based on the observed average swarm location, with no need to send control signals to any specific agent in the swarm.

Figures5

Click any figure to enlarge with its caption.

Equations20

\begin{split}p(k+1)=p(k)+c(k)\tilde{\Delta}(k)\\ c(k)=\left\{\begin{array}[]{ll}\mu&\quad\quad\tilde{\Delta}(k)^{T}d<0\\ 1&\quad\quad o.w.\end{array}\right.\end{split}

\begin{split}p(k+1)=p(k)+c(k)\tilde{\Delta}(k)\\ c(k)=\left\{\begin{array}[]{ll}\mu&\quad\quad\tilde{\Delta}(k)^{T}d<0\\ 1&\quad\quad o.w.\end{array}\right.\end{split}

E {Δ x} = 0.5 E {Δ x ∣ \tilde{Δ} x \geq 0} + 0.5 E {Δ x ∣ \tilde{Δ} x < 0} = 0.5 (1 - μ) E (\tilde{Δ} x ∣ \tilde{Δ} x \geq 0)

E {Δ x} = 0.5 E {Δ x ∣ \tilde{Δ} x \geq 0} + 0.5 E {Δ x ∣ \tilde{Δ} x < 0} = 0.5 (1 - μ) E (\tilde{Δ} x ∣ \tilde{Δ} x \geq 0)

E {∥ p (k + 1) ∥^{2} ∣ p (k)} = p (k)^{2} - A (\frac{1 - μ}{2}) ∥ p (k) ∥ + B (1 + μ^{2})

E {∥ p (k + 1) ∥^{2} ∣ p (k)} = p (k)^{2} - A (\frac{1 - μ}{2}) ∥ p (k) ∥ + B (1 + μ^{2})

E {∥ p (k + 1) ∥^{2}} = E {∥ p (k) ∥^{2}} - (A (\frac{1 - μ}{2}) E {∥ p (k) ∥} - B (1 + μ^{2}))

E {∥ p (k + 1) ∥^{2}} = E {∥ p (k) ∥^{2}} - (A (\frac{1 - μ}{2}) E {∥ p (k) ∥} - B (1 + μ^{2}))

k (δ) = \frac{D ^{2} ( 0 ) - ( \frac{B ( 1 + μ ^{2} ) + δ}{A ( \frac{1 - μ}{2} )} ) ^{2}}{δ}

k (δ) = \frac{D ^{2} ( 0 ) - ( \frac{B ( 1 + μ ^{2} ) + δ}{A ( \frac{1 - μ}{2} )} ) ^{2}}{δ}

p_{i} (k + 1) = p_{i} (k) - σ j = 1 \sum n (p_{i} (k) - p_{j} (k)) + \tilde{Δ}_{i} (k)

p_{i} (k + 1) = p_{i} (k) - σ j = 1 \sum n (p_{i} (k) - p_{j} (k)) + \tilde{Δ}_{i} (k)

\displaystyle\begin{split}&p_{i}(k+1)=p_{i}(k)+c(k)[-\sigma\sum_{j=1}^{n}(p_{i}(k)-p_{j}(k))+\tilde{\Delta}_{i}(k)]\\ &c(k)=\left\{\begin{array}[]{ll}\mu&\quad\quad\tilde{\Delta}_{cm}(k)^{T}d<0\\ 1&\quad\quad o.w.\end{array}\right.\end{split}

\displaystyle\begin{split}&p_{i}(k+1)=p_{i}(k)+c(k)[-\sigma\sum_{j=1}^{n}(p_{i}(k)-p_{j}(k))+\tilde{\Delta}_{i}(k)]\\ &c(k)=\left\{\begin{array}[]{ll}\mu&\quad\quad\tilde{\Delta}_{cm}(k)^{T}d<0\\ 1&\quad\quad o.w.\end{array}\right.\end{split}

\displaystyle\begin{split}&p_{i}(k+1)=\left\{\begin{array}[]{ll}p_{i}(k)&\quad\psi_{i}(k)\geq\pi\mbox{ or }\chi_{i}(k)=0\\ p_{i}(k)+c(k)\tilde{\Delta}_{i}(k)&\quad o.w.\\ \end{array}\right.\\ &\chi_{i}(k)=\left\{\begin{array}[]{ll}1&\quad\quad\mbox{w.p. }\delta\\ 0&\quad\quad\mbox{w.p. }1-\delta\end{array}\right.\\ &c(k)=\left\{\begin{array}[]{ll}\mu&\quad\quad\tilde{\Delta}_{cm}(k)^{T}d<0\\ 1&\quad\quad o.w.\\ \end{array}\right.\\ &\tilde{\Delta}_{i}(k)=\mbox{vector from $p_{i}(k)$ to a random point in $ar_{i}(k)$}\\ \end{split}

\displaystyle\begin{split}&p_{i}(k+1)=\left\{\begin{array}[]{ll}p_{i}(k)&\quad\psi_{i}(k)\geq\pi\mbox{ or }\chi_{i}(k)=0\\ p_{i}(k)+c(k)\tilde{\Delta}_{i}(k)&\quad o.w.\\ \end{array}\right.\\ &\chi_{i}(k)=\left\{\begin{array}[]{ll}1&\quad\quad\mbox{w.p. }\delta\\ 0&\quad\quad\mbox{w.p. }1-\delta\end{array}\right.\\ &c(k)=\left\{\begin{array}[]{ll}\mu&\quad\quad\tilde{\Delta}_{cm}(k)^{T}d<0\\ 1&\quad\quad o.w.\\ \end{array}\right.\\ &\tilde{\Delta}_{i}(k)=\mbox{vector from $p_{i}(k)$ to a random point in $ar_{i}(k)$}\\ \end{split}

E {Δ x_{c m}} \geq 0.25 (1 - μ) \frac{1}{n ^{2}} V a r^{*}

E {Δ x_{c m}} \geq 0.25 (1 - μ) \frac{1}{n ^{2}} V a r^{*}

V a r^{*} = δ^{2} (\frac{σ}{2})^{2} \frac{1 - cos ^{4} ( \frac{π - ψ _{*}}{2} )}{\frac{π - ψ _{*}}{2} - \frac{1}{2} sin ( π - ψ _{*} )}

V a r^{*} = δ^{2} (\frac{σ}{2})^{2} \frac{1 - cos ^{4} ( \frac{π - ψ _{*}}{2} )}{\frac{π - ψ _{*}}{2} - \frac{1}{2} sin ( π - ψ _{*} )}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems

Full text

11institutetext: Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel

11email: [email protected]

On Steering Swarms

Ariel Barel

0000-0003-3275-4264

Rotem Manor

0000-0002-2504-1509

Alfred M. Bruckstein

Abstract

The main contribution of this paper is a novel method allowing an external observer/controller to steer and guide swarms of identical and indistinguishable agents, in spite of the agents’ lack of information on absolute location and orientation. Importantly, this is done via simple global broadcast signals, based on the observed average swarm location, with no need to send control signals to any specific agent in the swarm.

1 Introduction

This paper deals with steering multi-agent systems, based on decentralized gathering laws, using an external broadcast control signal. Agents move according to local information provided by their sensors. The agents are assumed to be identical and indistinguishable, memoryless (oblivious), with no explicit communication between them. The agents do not share a common frame of reference i.e. agents are not equipped with either GPS systems or compasses. By assumption, agents sense the distance and/or bearing to their neighbours, within a finite or infinite range of visibility. An external observer/controller continuously monitors the swarm’s location and broadcasts the same control signal, based on the centroid of the agents’ constellation. We present a simple yet practical method to steer the swarm and guide it to a given destination.

Note that unlike the simple agents that are anonymous, unaware of their position, lack memory, and do not use explicit communication to maintain the swarm cohesion, the external controller does need the ability to continuously monitor the trajectory of the swarm location. Due to these capabilities, the controller is able to influence the movement of the swarm, with a very simple global control signal broadcast simultaneously to all agents.

The inspiration to this control method came from the following observation: some of the gathering algorithms, while they ensure the convergence of agents to a bounded area, do not imply that the centroid of the agents’ location remains stationary in the plane [1, 12, 16, 13, 17, 9, 6, 14, 4]. In fact, some gathering algorithms exhibits random walk like behaviour of the centroid of the agents’ constellation after gathering as discussed in [3]. The method to steer the swarm to a target point, presented herein, exploits the movements of the system’s center of gravity due to the agents’ compliance with the distributed convergence algorithm.

2 How to Control a Single Agent

We first describe the basic idea in conjunction with a single agent performing a random walk in the plane, and then extend the discussion to multi-agent systems carrying out various cohesion ensuring gathering algorithms. Assume a drunkard agent is moving in the plane in the following random way: at discrete times $k=1,2,3,...$ he selects a new destination for time $k+1$ . The destination location $\tilde{p}(k+1)$ is randomly and homogeneously distributed in a unit disc centered at its current position $p(k)$ , so that $\tilde{p}(k+1)=p(k)+\tilde{\Delta}(k)$ , where $\tilde{\Delta}(k)$ is a random vector uniformly distributed in a unit disc. After selecting $\tilde{p}(k+1)$ the agent starts going there from $p(k)$ in a straight path. By monitoring his motion, one can steer him in any direction with the following control rule: if the projection of his current movement on the required direction is positive - allow the drunkard to finish his step. Otherwise, stop him after a fraction of the unit interval $\mu<1$ , by broadcasting (shouting) a startling “stop!” signal.

This process will cause the drunkard to perform a biased walk, making, in expectation, bigger steps in the desired direction. To bring the drunkard toward a region near a precise target point in the plane, one may define the desired direction to always point from the current location of the drunkard to the goal. Assume first, for simplicity, that the desired direction is fixed. Let $p(k)$ be the current position of the agent and let $d\in\mathbb{R}^{2}$ be a unit vector in the direction in which we require the agent to move. Denote by $\tilde{\Delta}(k)$ the planned travel vector of the agent for the current time period $[k,k+1)$ , from $p(k)$ , its position at time $k$ , to a homogeneously distributed random point in a unit disc centered at $p(k)$ , and by $\Delta(k)$ its actual travel vector. The relation between $\tilde{\Delta}(k)$ and $\Delta(k)$ is as follows : at time $k$ the agent starts traveling from its existing position $p(k)$ to its planned position $\tilde{p}(k+1)=p(k)+\tilde{\Delta}(k)$ in a piecewise constant velocity equal to $\tilde{\Delta}(k)/1$ . If $\tilde{\Delta}(k)^{T}d\leq 0$ , the external controller stops the agent at a fraction $\mu$ of the time-step, i.e. $\Delta(k)=\mu\tilde{\Delta}(k)$ , otherwise the controller does not interrupt its motion during the current time period, hence $\Delta(k)=\tilde{\Delta}(k)$ . Therefore we have

[TABLE]

where $\tilde{\Delta}(k)$ is a vector from $p(k)$ to the homogeneously distributed random point in a unit disc centered at $p(k)$ . By symmetry of the random distribution function, for any direction $x$ , we have that the expectation of a planned step is $\mathbf{E}\{\tilde{\Delta}x(k)\}=0$ . The required direction of movement $d$ is, without loss of generality, towards the positive $x$ axis, i.e. to the right. Clearly, by the symmetry of the distribution function, we have that the probabilities that the drunkard moves right and left are same and equal $0.5$ . Hence, the expected actual travel of the agent, given external controller’s (possible) interruptions, is (omitting the time index $(k)$ for simplicity):

[TABLE]

In order to guide an agent to a target point, the controller can set the required direction at each time-step, from the current position of the agent to the target point. Let us find the expected position of the agent at time $(k+1)$ given $p(k)$ , i.e. $\mathbf{E}\{\|p(k+1)\|^{2}\mid p(k)\}$ . By the law of cosines in a triangle [5] we obtain that

[TABLE]

where $A=\mathbf{E}\left\{\frac{\tilde{\Delta}(k)^{T}p(k)}{\|p(k)\|}\operatorname{sgn}\left\{\frac{\tilde{\Delta}(k)^{T}p(k)}{\|p(k)\|}\right\}\right\}$ is positive and depends only on the direction vector $d(k)=\frac{p(k)}{\|p(k)\|}$ , and for a rotationally symmetric $\tilde{\Delta}(k)$ it is independent of $d(k)$ (and on $p(k)$ of course), and $B=\mathbf{E}\{\|\tilde{\Delta}(k)\|^{2}\}$ is positive and obviously independent on $p(k)$ . From this result it follows that

[TABLE]

We have that if the right expression in big parentheses in (4) is bigger than $\delta$ , $\mathbf{E}\{\|p(k)\|^{2}\}$ decreases by $\delta$ , and while this inequality persists, it will decrease until $\mathbf{E}\{\|p(k)\|\}\leq\left(\frac{B(1+\mu^{2})+\delta}{A(\frac{1-\mu}{2})}\right)$ . Returning to (4) we have that after $k(\delta)$ steps, given by

[TABLE]

the process will necessarily stop and the agent will be “near” the target. Simulated results of $k$ vs. $\delta$ for some different initial values of $D(0)$ and the graph of Equation (5) plotted in Figure 1 shows that the theoretical $k(\delta)$ is indeed a rather loose upper bound on the number of steps needed to reach the target’s neigbourhood.

3 Controlling Multi-Agent Systems - the Idea

Let us adopt this steering method to a multi-agent system. Suppose there is a multi-agent system which converges to a bounded area. The lack of a global orientation of the agents prevents the viewer from simply broadcasting the desired direction of movement as suggested by Azuma et. al. [2] and others, since the agents are unable to obey global-direction-based commands. Research methods that draw inspiration from animal behaviour in herds in nature e.g. [7] are based on the fact that part of the group moves in a certain direction and indirectly influences the group’s behaviour, but in this article we assume that even leaders do not know how to orient themselves and find the desired direction of movement. Additionally, recall that our agents are anonymous and indistinguishable, hence an external observer wishing to lead the system in a required direction can not steer individual agents separately by transmitting control commands to each one of them. We show here that an external observer can lead a multi-agent system in a required direction (while the agents also converge to a bounded region), by only sensing the motion of the system’s centroid. This information represents for the external controller the location of the group, and it is feasible to measure or estimate in real life multi-agent scenarios, especially for large numbers of agents, such as swarms of drones. Let $p_{cm}(t)=\frac{1}{n}\sum_{j=1}^{n}p_{i}(t)$ be the system’s centroid. The velocity of the centroid is the average velocities of the agents $\dot{p}_{cm}(t)=\frac{1}{n}\sum_{j=1}^{n}\dot{p}_{i}(t)$ and we have that while all agent velocities are constant the centroid velocity is constant as well. We assume that during each time interval $k=1,2,3,...$ each agent’s velocity is constant, therefore we have that $\hat{\dot{p}}_{cm}(t)$ , the direction of the centroid movement is piecewise constant (i.e. does not change during time intervals hence moves in straight lines). Similar to our discussion in section 2, here, the external controller tracks the motion of the centroid of the system. If the projection of its movement is on the required direction ( $\tilde{\Delta}_{cm}(k)^{T}d\geq 0$ ) - it allows all the agents to finish their planned travels. Otherwise, it stops them all after a fraction $\mu$ of the time-step, i.e. when they complete a fraction $\mu$ their planned travel. We discuss in detail different types of such systems, and bound the expected “velocity” of the swarm’s centroid due to this control mechanism.

3.1 Steering a System of Agents with Infinite Visibility and Full Sensing

We begin with a simple linear multi-agent gathering process in discrete time for the infinite visibility and full sensing case. Each agent $i$ moves according to the decentralized dynamic law: $p_{i}(k+1)=p_{i}(k)-\sigma\sum_{j=1}^{n}(p_{i}(k)-p_{j}(k))$ , where $0<\sigma<\frac{2}{n}$ is a constant gain factor, i.e. at each time-step, each agent jumps proportionally to the sum of relative position vectors to all the other agents (recall system $\mathcal{S}_{2}$ , in [3]). As proved by Gazi, Passino et. al. [8], since the dynamics of such system is governed by an antisymmetric pairwise interaction function, the average position of the agents is invariant. To steer this system in some desired direction, we would like to bias the motion of the system centroid by measuring its trend, hence we assume some additive “noise” that breaks symmetry and causes the center of the system to move. We hence assume that each agent, in addition to obeying the distributed control law above, also moves to a randomly selected point at each time step:

[TABLE]

where $\tilde{\Delta}_{i}(k)$ is a randomly selected point in a unit disc. Here too, at time $k$ the agents start traveling from their existing positions $p_{i}(k)$ towards their next planned positions $\tilde{p}_{i}(k+1)$ in piecewise constant velocities equal to their distance from it $[-\sigma\sum_{j=1}^{n}(p_{i}(k)-p_{j}(k))+\tilde{\Delta}_{i}(k)]/1$ , so that if an external controller does not intervene, all the agents arrive at their destinations simultaneously at time $k+1$ . Hence we may denote the planned motion of the centroid to be $\tilde{\Delta}_{cm}(k)=\bar{\tilde{p}}(k+1)-\bar{p}(k)=\frac{1}{n}\sum_{i=1}^{n}\tilde{\Delta}_{i}(k)$ , and the control mechanism for system (6) is:

[TABLE]

Here $c(k)$ represents the optional “stop” signal received simultaneously at fraction $\mu$ of the time-step by all agents, $\tilde{\Delta}_{cm}(k)=\frac{1}{n}\sum_{i=1}^{n}\tilde{\Delta}_{i}(k)$ is the planned travel of the centroid of the agents, and $d$ is the required direction of movement of the system. Since the projection on $x$ of the second moment of a disc of radius $r$ is $\frac{1}{4}\pi r^{4}$ , we have in this system [5] that $\mathbf{E}\{\Delta x_{cm}\}\geq 0.5(1-\mu)\frac{1}{8n}$ i.e. the bound on the expected step of the centroid is inversly proportional to the number of agents. To guide a system to a goal point, the observer controller should set the desired direction at every time interval so $d(k)$ is a unit vector from the centroid of the system to the goal point. Figure 2 presents a typical simulation result of this system with full visibility and complete sensing, with some evenly distributed noise jump to a unit disc of each agent, as presented in equation (7).

3.2 Steering a System of Agents with Limited Visibility and Bearing Only Sensing

Here we assume that the agents are able to sense the direction to their neighbours (i.e. bearing only sensing), and their motions being determined by the set of unit vectors pointing from their current location to their neighbours. The neighbours are defined for each agent $i$ at time-step $k$ as the set of agents located within a given visibility range $V$ form its position $p_{i}(k)$ . Manor et. al. [15] modified Gordon’s et. al. motion laws [10] [11], and proved that the new law gathers the agents of the system to a disc with a radius equal to the agents’ maximal step size $\sigma$ within a finite expected number of time steps, and that the distribution of the agents’ average position converges in probability to the distribution of a random-walk. As in section 3.1, we assume here piecewise continuous dynamics (where agents continuously move towards their new locations), so that the formal steering algorithm for this system is:

[TABLE]

where $\tilde{\Delta}_{cm}(k)=\sum_{i=1}^{n}\tilde{\Delta}_{i}(k)$ is the planned jump of the centroid of the system, and $d$ is a unit vector in the required moving direction of the system. It was proved in [15] that the original model, given no external control, satisfies $\mathbf{E}\{\Delta_{cm}(k)\}=0$ , and that

[TABLE]

Figure 2 presents simulations result of this system (8). The system gathers and moves to a goal, and the trace of the travel of the system’s centroid is plotted.

4 Conclusions

A method has been introduced here that allows an external observer to control a multi-agent system and guide it to a desired destination even when the agents are very primitive. According to our paradigm all the agents are identical (anonymous), therefore the external observer can not send a separate command to each agent, but can broadcast the same command to all the agents. The viewer controls the swarm by means of an identical command sent simultaneously to all agents. The method was tested for different cases: the control of a single moving agent performing random-walk, steering of a system with infinite visibility and relative distance and bearing measurement, and control of a system with partial information (limited visibility and bearing only measurement).

Acknowledgments.

This research was partly supported by Technion Autonomous Systems Program (TASP).

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Ando, H., Oasa, Y., Suzuki, I., Yamashita, M.: Distributed memoryless point convergence algorithm for mobile robots with limited visibility. Robotics and Automation, IEEE Transactions on 15 (5), 818–828 (1999)
2[2] Azuma, S.i., Yoshimura, R., Sugie, T.: Broadcast control of multi-agent systems. Automatica 49 (8), 2307–2316 (2013)
3[3] Barel, A., Manor, R., Bruckstein, A.M.: Come together: Multi-agent geometric consensus (gathering, rendezvous, clustering, aggregation). Tech. rep., CIS Technical Report, TASP (2016)
4[4] Barel, A., Manor, R., Bruckstein, A.M.: Probabilistic gathering of agents with simple sensors. Tech. rep., CIS Technical Report, TASP (2017)
5[5] Barel, A., Manor, R., Bruckstein, A.M.: On steering swarms. Tech. rep., CIS Technical Report, TASP (2018)
6[6] Bellaiche, L.I., Bruckstein, A.M.: Continuous time gathering of agents with limited visibility and bearing-only sensing. Tech. rep., CIS Technical Report, TASP (2015)
7[7] Couzin, I.D., Krause, J., Franks, N.R., Levin, S.A.: Effective leadership and decision-making in animal groups on the move. Nature 433 (7025), 513 (2005)
8[8] Gazi, V., Passino, K.M.: Stability analysis of social foraging swarms. Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on 34 (1), 539–557 (2004)