TL;DR
This paper investigates a probabilistic gathering algorithm for anonymous, oblivious agents with simple sensors, demonstrating finite-time convergence through empirical and theoretical analysis under different motion models.
Contribution
It introduces a novel probabilistic gathering method for agents with minimal sensing and no communication, providing empirical and theoretical insights into convergence behavior.
Findings
Clustering occurs in finite expected time with high probability.
Continuous sensing and motion can guarantee gathering with a blind-zone.
Convergence time depends on the number of agents and blind-zone size.
Abstract
Gathering is a fundamental task for multi-agent systems and the problem has been studied under various assumptions on the sensing capabilities of mobile agents. This paper addresses the problem for a group of agents that are identical and indistinguishable, oblivious, and lack the capacity of direct communication. At the beginning of unit-time intervals, the agents select random headings in the plane and then detect the presence of other agents behind them. Then they move forward only if no agents are detected in their sensing "back half-plane". Two types of motion are considered: when no peers are detected behind them, either the agents perform unit jumps forward, or they start to move with unit speed while continuously sensing their back half-plane, and stop whenever another agent appears there. For the first type of motion extensive empirical evidence suggests that with high…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\newsiamremark
remarkRemark \newsiamremarkhypothesisHypothesis
\newsiamthmclaimClaim \headersProb. Gathering with Simple SensorsA. Barel, T. Dagès, R. Manor, and A. M. Bruckstein
\externaldocumentex_supplement
Probabilistic Gathering of Agents with Simple Sensors††thanks: Submitted to the editors April 12, 2020.
Ariel Barel Department of Computer Science, Technion - Israel Institute of Technology, Haifa, ISRAEL (, http://www.cs.technion.ac.il/people/arielba/). [email protected]
Thomas Dagès22footnotemark: 2
Rotem Manor22footnotemark: 2
Alfred M. Bruckstein22footnotemark: 2
Abstract
Gathering is a fundamental task for multi-agent systems and the problem has been studied under various assumptions on the sensing capabilities of mobile agents. This paper addresses the problem for a group of agents that are identical and indistinguishable, oblivious, and lack the capacity of direct communication. At the beginning of unit-time intervals, the agents select random headings in the plane and then detect the presence of other agents behind them. Then they move forward only if no agents are detected in their sensing “back half-plane”. Two types of motion are considered: when no peers are detected behind them, either the agents perform unit jumps forward, or they start to move with unit speed while continuously sensing their back half-plane, and stop whenever another agent appears there. For the first type of motion extensive empirical evidence suggests that with high probability clustering occurs in finite expected time to a small region with diameter of about the size of the unit jump, while for continuous sensing and motion we can prove gathering in finite expected time if a “blind-zone” is assumed in their sensing half-plane. Relationships between the number of agents or the size of the blind-zone and convergence time are empirically studied and compared to a theoretical upper-bound dependent on these factors.
keywords:
Control, Decentralized, Gathering, Multi-Agent, Simple Sensors
{AMS}
93A14, 93C10, 93E15
1 Introduction
This paper deals with gathering of multi-agent systems, based on a simple decentralized control law. Agents move according to local information provided by their sensors. The agents are assumed to be identical and indistinguishable, memoryless (oblivious), with no explicit communication between them. They do not have a common frame of reference (i.e. agents are not equipped with GPS sensors or compasses). This paper is an extended and improved version of a preprint that appeared on arXiv in 2018 [4].
A wealth of gathering algorithms were described and analyzed in the multi-agent robotics literature. They differ in the assumptions made on the sensing that is performed by the agents, in the assumptions on the possible moves that can be made and the computational requirements for the decision process that leads to the response [6, 16, 1, 11, 7, 9, 10, 14, 12, 15, 8, 5, 13].
A recent report [3] surveyed in detail gathering or clustering algorithms under the assumptions of limited vs. unlimited visibility sensing, complete relative position sensing vs. bearing only information provided by the sensors, and discrete vs. continuous motion schedules for the agents. We here present a novel gathering, or geometric consensus algorithm based on a simple randomized rule of motion for agents with headings that can only sense the presence or absence of other agents in a half-plane behind them. We assume that the agents have intrinsic (but not global) orientation and they can move forward, but each agent can, at various instances, select a new heading at random. This is accomplished by doing an independent and uniformly distributed turn over from their current orientation.
Under these assumptions, we consider two different gathering algorithms. The first one assumes that agents make a forward jump of size whenever there are no agents behind them, i.e. in their back half-plane. The agents act synchronously and new headings are selected at random and independently at each unit-time, then their “back half-plane” sensor’s reading tells the agents whether to jump forward or stay put. Extensive experimental results with this process shows that probabilistic gathering to a small region occurs in time proportional to the number of agents. We cannot prove this result yet.
The second gathering algorithm we discuss is a continuous version of this process. Here we assume that the sensing is continuously done, and the forward motion of agents during each unit time-interval is continuously conditioned on the absence of agents in the sensing area behind them. As we shall see, in this case we also need to assume a blind-zone for the backwards sensing, in order to avoid dead-lock situations preventing gathering. Under these new assumptions we can prove gathering in finite expected time, and obtain a bound on the gathering time dependent on the number of agents, the initial spread of the group, and the size of the blind-zone in sensing.
Extensive experiments are performed for both gathering algorithms. Results not only confirm our theory but also provide further empirical insight into the behaviour of the swarms. The code has been made public111https://github.com/Tommoo/SwarmGatheringHalfPlanes.
2 The discrete algorithm
Consider a system of identical, anonymous, and memoryless agents specified by their time varying locations in the euclidean plane and heading vectors which are unit vectors randomly selected on the unit circle. These quantities are unknown to the agents themselves as they lack global position and orientation information. We define “heading” as the direction where the agent’s nose is pointing, i.e. its current direction of (possible) motion. The agents implicitly interact with each other in such a way that an agent’s next position after one time unit is determined by the constellation of all the agents in the system.
2.1 Sensing
Each agent is equipped with an onboard sensing device, aimed in the opposite direction to the agent’s heading , covering the back half-plane (with field of view). We may call this sensing device a “Backward Looking Binary Sensor”. If there is no other agent in the field of view of agent , the output signal is , else .
2.2 Timing and the Motion Law
At time the agents are in an arbitrary initial constellation with randomly selected headings, and perform forward jumps if their sensor reading is . Then at each time-step every agent changes its heading direction by choosing a uniformly distributed random turn from resulting in a new random heading , and then, if its back closed half-plane is empty, it jumps forward a fixed step-size . Otherwise it stays put until the next time-step.
The dynamic motion law is formally described as follows. Denote the locations of the agents at time-step by . Then:
[TABLE]
is the binary output from the sensor as seen in Fig. 1, and \hat{\theta}_{i}(k)=\bigl{[}\begin{smallmatrix}\cos({\theta_{i}(k)})\\ \sin({\theta_{i}(k)})\end{smallmatrix}\bigr{]} is the agent’s random heading.
2.3 Simulation Results and System Behavior
Typical simulation results of gathering are shown in Fig. 2. In all the simulations we ran, a number of agents are randomly placed in the plane, and in all cases the system converges to an area within a circular region of radius less than , whose “center” wanders at random in the plane. Empirically, once we have reached convergence, this radius seems to randomly fluctuate around the value with variance decreasing with the number of agents. Furthermore, once convergence has been achieved, any departure from the unit disk implies a radius lower than due to the rules of motion, and empirically it decreases again to go back to convergence within a disk of radius at most . This phenomenon seldom happens and only on systems with very few agents, typically less than .
We found that, for estimating the expected time of convergence, a choice of runs provides a reliable estimator. A detailed analysis of this choice is presented as supplementary material in Section 1.1. Figure 3 summarizes simulation runs with different number of agents, spread uniformly over the same initial area. Notice that the effect of the number of agents on the convergence time of the system is linear, and we found a slope of approximately unit time-steps per number of agents.
As shown in Fig. 4, only the agents occupying the corners of the convex-hull of the constellation may select an orientation with empty back half-planes, thus only they may jump, while the inner agents necessarily stay put until they become external. Since agents’ headings change randomly, agents on the convex-hull of the system have a high probability to jump towards all other agents, hence the system tends to gather, as shown in the simulations.
But this simplified account does not take into consideration the fact that even divergence may occur due to blind unit jumps, however with negligibly small probability. An adversarial argument clearly illustrates this. Indeed, consider a system of two agents, , and assume that the agents’ headings are (almost) perpendicular to the line they define and oriented in opposite directions (so that we have and ). Since both their back half-planes are empty they both jump forward, and obviously we have that the distance between them increases as a result of the move.
We next suggest a modified gathering algorithm with continuous-time dynamics, for which we can prove gathering to a very small region in finite expected time.
3 Continuous-time dynamics
In this system, once in a unit time-interval , simultaneously, each agent changes its heading direction by choosing a uniformly distributed random angle between [math] and . Here too, the sensor is aimed backwards (at ) but it includes a “blind-zone” half disc area of radius , typically (see Fig. 5). During the time-interval an agent keeps its heading direction, and if its sensing area is empty it moves forward with a fixed velocity , otherwise it stops. Note that we assume that the agent may move and stop during according to the changing constellation of the system.
Denote by the distance between agents and at time , so that if , the agents are too close to potentially see each other, otherwise we call them “separated”. The dynamic law in continuous-time is:
[TABLE]
For the agents’ dynamics, by abuse of notation, derivatives actually represent well-defined right derivatives. In the following proofs we often omit the time index .
Lemma 3.1**.**
*The distance between two “separated” agents never increases. *
Proof 3.2**.**
Suppose agents and are “separated” at time so that . Denote by the (current) small angle between vector and the heading direction of agent as shown in Fig. 5.
The derivative of the distance between and (that is the inverse of their approach speed) is given by
[TABLE]
By the dynamic law, if the sensor coverage area of agent is not empty (i.e. ) it does not move. In this case its speed is . Otherwise, all other agents including agent are either in front of it or in its “blind-zone” and then it moves forward at . However, since we assumed agents and to be “separated”, agent is then in this case necessarily in front of agent , thus , i.e. . Similar arguments hold for agent , that is either or and . Hence we have that
[TABLE]
Corollary 3.3**.**
*Agents within range at time remain within range at all . *
Proof 3.4**.**
*This is a direct consequence of Lemmas 3.1 and 4. Note that if an agent is closer than to agent (so that ), in order for to increase above , it first has to reach the value since when an agent moves, it moves in a continuous motion, but by Lemma 3.1 the value cannot increase. A detailed proof for this corollary is given in Appendix A. *
Next we show that if not all agents are confined inside a circle of radius there is a strictly positive probability bounded away from zero for the distance between pairs of agents to decrease at a positive and bounded away from zero rate, until all agents are confined in a circle of radius .
3.1 Proof of Convergence
Theorem 3.5**.**
*Continuous dynamics with agents acting according to the motion law given in Eq. 2, converges to a region of radius in finite expected time. *
Proof 3.6**.**
We know by Corollary 3.3 that pairs of agents within range from each other at some time will remain so forever. We shall next show that while in the agents’ configuration there exists pairs of agents at distance bigger than from each other, there is a significant (bounded away from zero by a constant) probability that there will be a significant decrease in the distance between them.
3.1.1 Preliminaries
Lemma 3.7**.**
*As long as not all agents are confined in a circle of radius , there is an agent with a strictly positive probability bounded away from zero by a constant to move a distance bounded away from zero by another constant during . *
Proof 3.8**.**
The sum of corner angles of any convex polygon of corners is given by , therefore the sharpest corner of a convex polygon of corners is bounded from above by . Since the maximal number of corners of the convex-hull of a system of agents is , we have that , the sharpest corner of the convex-hull of a system on agents, is bounded by
[TABLE]
Let be the convex-hull angle at a corner , and let and be the locations of the corners of the convex-hull adjacent to (see Fig. 6).
Denote by the smaller angle between the two lines perpendicular to the sides of so that if the random heading of agent falls inside it can move. Note that if the heading of is precisely along one of the sides of (and not explicitly inside ), agent may not move (as then or may be located in its sensing area). Therefore consider the symmetric central half of about the bisector of (and too) denoted by . If the heading of agent on the convex hull falls inside , then agent is guaranteed to move at the beginning of the time-interval. The probability for this event to happen is . Hence the probability for agent (defining the convex-hull) to move is lower-bounded as follows:
[TABLE]
To further bound this probability independently of the constellation, let us consider the agent at the sharpest corner of the convex-hull, so that by Eq. 5 we know that . Hence the probability of , the agent at the sharpest corner of the convex-hull to move is lower-bounded by
[TABLE]
Let us define the case where the heading of agent is inside its associated (at the beginning of a time-interval) as a “successful time-interval”, and its associated bound Eq. 6 as “the minimal probability for a successful time-interval” at the beginning of . We shall next prove that for a “successful time-interval” agent moves a distance bounded away from zero by a constant.
To find a bound on the minimal distance that agent will surely travel if a successful heading was selected (with probability greater than ), we must compute the smallest distance that agent can move ahead unimpeded by any other agent entering its sensing area.
Assume the heading of an agent at the sharpest corner of the convex-hull is inside its associated as shown in Fig. 6. Agent (adjacent to on the convex-hull) and agent define a side of the convex-hull, therefore is located somewhere along this side of .
By assumption, the agents of the system reside in a convex region contained in the front half-plane of the moving agent . The geometry of the convex region may change in time as other agents can possibly move. However, due to the bounded speed of all agents, the region where all agents reside during some time after the motion of agent started will be contained within a dilation of the original convex-hull by a disc of radius .
Thus agent will keep advancing at least until its forward moving sensing region intersects the dilation of the convex-hull of the agents with the same speed (or does not sense any other agents during this time-interval). Since agents , , and are successive agents on the convex hull, the convex hull at the beginning of the time-step resides completely within the angle: the smaller region limited by the half-lines and . Therefore the dilation of the convex hull is included within the dilation of these half-lines. To lower bound the step made by agent , we study when the dilation of these half-lines first intersect the translation of the sensing region of agent .
The dilation of the half-lines consists in the union of three simple geometric objects: the translation of the two half-lines in the outward direction orthogonal to them, and a circular arc with fixed center and angle and with growing radius.
Assume the translation of the sensing region of agent first intersects the dilation of the half-lines on one of their translations. In this case, from simple considerations, the first intersection happens on the translation of the half-line with smallest angle with respect to the original half-plane of sensing of agent . Without loss of generality assume this happens on the side.
We then clearly see that the intersection between the sensing region of agent and the translating half-line will happen when this half-line intersects the corner point at distance of agent on the side. This happens at point which is located a distance of , where is (see Fig. 7). Since the heading of agent is assumed to be within the region, we have
[TABLE]
We have by Eq. 5 that , therefore
[TABLE]
And since , we have that , and therefore (as ) we have that
[TABLE]
This shows that agent , in this case, will move by at least a distance of
[TABLE]
before possibly being stopped by the motion law we defined.
Assume now that the translation of the sensing region of agent first intersects the dilation of the half-lines on the dilating open circular arc around .
Since the heading of agent is inside the angle, then agent moves “forward”, in the sense that if the velocity vector defines an axis, then the projection of the displacement of agent onto that axis is positive. Assume that the first intersection between the constant angle growing radius circular arc and the translated half-circle border of agent ’s sensing region is on a corner point of the border of the sensing region of at distance from it. As the normal to the circular arc of the sensing region points towards the current position of agent and the normal to the growing radius circular arc points towards the initial position of agent , we then have that the projection of the vector is negative onto the axis of displacement of , which is a contradiction. Furthermore, we also have that any intersection with the half-line borders of the sensing region of agent is clearly not the first possible one. Therefore the first intersection of the growing radius circular arc and of the translating border of sensing of agent necessarily happens in the interior of the half-circle border of the sensing region of agent .
We thus have that, in this case, the first intersection would happen in the interior of both circular arcs. This further implies, since we are considering the first intersection, it happens when both circles are tangent, as shown in Fig. 8. Therefore the center of each circle and the intersection point, denoted by , are aligned. The center of the dilating circle, of radius , is the original position of agent . The center of the translating half-circle border, of radius , of the sensing region is the new position of agent at intersection time, separated by from the original position. Therefore the time of first intersection is with .
This shows that agent , in this second case, will move by at least a distance of
[TABLE]
before possibly being stopped by the motion law we defined. However
[TABLE]
which is always true since we have agents.
We therefore conclude that for all cases, if the heading of agent falls inside , then it moves by at least
[TABLE]
before possibly being stopped by the motion law we defined.
The minimal displacement of the agent defining the sharpest corner of the convex-hull during one time-interval is bounded by the smallest value between above and the physical limit due to the travel speed , i.e. , hence we have that the bound on the step of the agent at the sharpest corner of the convex-hull is
[TABLE]
3.1.2 The Lyapunov Function
A function is called Lyapunov if it maps the state of the system to a non negative value in such a way that the system dynamics causes a monotonic decrease of this value. If the Lyapunov function reaches zero only at desirable states of the system and we prove that the dynamics leads the Lyapunov function to zero, we can argue that the system converges to a desirable state.
For the proof of system convergence, let us define variables as follows:
[TABLE]
and a global variable :
[TABLE]
so that if we have that the system is confined in a disc of radius in the plane.
Let us define the following Lyapunov function:
[TABLE]
By Lemmas 3.1 and 3.3 we have that can never increase, hence never increases. We shall next prove that with a probability which is finite and bounded away from zero by a constant, decreases by a positive and bounded away from zero quantity, until it reaches the value zero (within a finite expected time), which will then give us that will go to zero in finite expected time.
Lemma 3.9**.**
*If at time , the beginning of a time-interval, there is an agent distant more than from the agent , currently located at the sharpest corner of the convex-hull, the probability that is at least a constant bounded away from zero, is higher than . *
Note that by Lemma 3.9, we have that in finite expected time all agents will necessarily be confined inside a disc of radius .
Proof 3.10**.**
In order to evaluate the influence of agent ’s motion on the Lyapunov function defined by Eq. 11, we need to see how a step bigger than or equal to can influence in the sum defining .
Since agents and are assumed to be separated at time , then gathering has not yet been achieved at the beginning of the time-interval.
Suppose agent is located somewhere in the region shown in Fig. 9 as region . We can easily lower bound the probability that it will remain stationary during the entire initial motion of agent as follows.
If agent ’s heading falls inside the angle, as illustrated in Fig. 9, then it can sense all points in and thus will stay put throughout the initial motion of agent until stops at time . Thus agent stays put during the entire initial motion of agent with probability at least .
Now suppose that agent is located in region shown in Fig. 9. We can apply a similar reasoning by replacing by where is the first time when comes within range of agent ’s initial position . Likewise we can define an angle ensuring that if agent ’s heading falls within then agent will sense all points within which implies to stay put within , and thus come within distance of agent during ’s initial displacement. This will happen with probability at least .
With probability or larger, agent will not move, either during the entire initial movement of agent if is in region 2 or during the initial movement of agent until it comes within range of this agent if is in region 1, since, in both cases, agent will sense agent during its entire corresponding initial movement. From now on we will assume that this event occurs, i.e. that if is in region 2 then stays put during the entire initial movement of , and if is in region 1 then stays put until it is within distance from
As agent starts to move, depending on whether it is in region 1 or 2, agent does one of the following:
- (i)
stays put and becomes, due to the motion of , within range of from and then and will decrease by at least each. 2. (ii)
remains stationary at a distance bigger than from for the entire motion of , and in this case we can bound the decrease of and as follows:
Let us consider
[TABLE]
where (see Fig. 9) is the mutual distance of agents and at the beginning of a time-interval , and is their mutual distance at time when agent first stops. Using Lemma 3.1, the shrink of distance between agents and during the time-interval is at least:
[TABLE]
Let us consider as the value of a function of three variables , , :
[TABLE]
We have that
[TABLE]
[TABLE]
[TABLE]
and
[TABLE]
If agent is in region , as shown in Fig. 9, then due to Eqs. 7, 14, 15, 16, and 17, we have
[TABLE]
where , , and , thus:
[TABLE]
For finite we can show that is a constant bounded away from zero
[TABLE]
see Appendix B for full details. Note that:
[TABLE]
We have therefore proven that if agent is further than away from agent , the agent at the sharpest corner of the convex-hull, at the beginning of the time-interval, and in case of a “successful time-interval”, i.e. the heading of agent is within , then, due to Eq. 13, the Lyapunov function will decrease by at least a strictly positive constant , which is the minimum of and . By Eqs. 20 and 21, we get:
[TABLE]
We shall call such an event, described in the previous paragraph and giving a shrink in the Lyapunov function of at least , a “fully successful time-interval”, abbreviated “FSTI”. Since the headings of agent and are drawn independently, the event “ stays put during the initial motion of ” happens with probability greater than conditionally to the event “successful time-interval”, which itself happens with probability greater than (given in Eq. 6). Therefore the probability of a “FSTI”, which is the intersection of both previous events, to occur is lower-bounded by
[TABLE]
To complete the proof of Theorem 3.5, the Lyapunov function will become zero when there is a point in whose distance to all agents is smaller then . We study the expected number of time-intervals necessary for this to happen.
Recall that if is an event that occurs in a trial with probability , we have that the mathematical expectation of the number of trials for the first occurrence of in a sequence of independent trials is
[TABLE]
In our case, the probability to have a “fully successful time-interval” is greater than as given in Eq. 23 in all trials, and this event is independent from the past intervals. Thus, using Eq. 24, the expectation of the first occurrence of a “fully successful time-interval” is finite and upper bounded as follows:
[TABLE]
Once we have reached a “fully successful time-interval”, having the next “FSTI” in future intervals is independent from the first one, and we can apply the same reasoning for the second occurrence of this event. By reiterating this argument, we thus get that the expectation of the number of time-intervals for convergence inside a disc of radius is finite and is upper-bounded by
[TABLE]
since clearly the Lyapunov function equals zero is equivalent to all agents are confined in a region of radius . Since the initial value of the Lyapunov function is less than (as in the chosen Lyapunov each “edge” is counted twice), and due to Eqs. 18 and 26, the expected number of time-intervals to convergence of the system is upper-bounded as follows
[TABLE]
which is finite, and dependent on the initial constellation, the number of agents , and the radius of the blind-zone . We have, following Eq. 8, that . In particular, for sufficiently small, i.e. , that:
[TABLE]
Note how the upper-bound is approximately proportional to . This suggests that the blind-zone could be necessary in order to ensure finite expected time convergence.
3.2 Simulation Results and System Behaviour
In order to approximate continuous sensing in numerical simulations, we further divided each unit time-step into smaller time-steps . In each of these smaller time intervals, the agents’ dynamics are similar to those in the discrete case defined in Eq. 1, except that the agents keep the same heading and jump only a distance of . Every small time-steps, we have reached the end of the unit time-step and all agents randomly change their heading. In order to remain within an approximation of the continuous setting, we need to choose such that , therefore various choices are made for different sizes of blind-zone radii.
Up to scale, the dynamics of the agents are clearly invariant: given a sequence of random orientations, if the edge of the square domain size is multiplied by , and if we multiply the blind-zone radius of agents by , then the dynamics of the agents are exactly a -scaled version of the dynamics they would have been with the same randomised sequence of orientation in the unit square with blind-zone radius . We therefore choose to set the domain to a fixed size in all our simulations: agents are initially randomly and uniformly placed in a unit square.
Given this choice, our domain of interest for the size of the blind-zone is . Since for agents, then such choices of imply therefore the expected time to convergence is given by Eq. 28 in our simulations.
Furthermore, in order to avoid undesired bias due to the initial distribution of the points, agents initial positions are randomly and independently resampled for each run. Thus the initial Lyapunov value or likewise the initial maximum distance between any two agents is a random value. However, we can bound these values given our choice of fixed domain size. For instance, thanks to Pythagoras, and thus Eq. 28 becomes
[TABLE]
which we will use as a reference bound when considering several runs. Note that here we have made an abuse of notation. Indeed, in Eq. 28, the time for convergence implicitly depends on the initial configuration of the agents and so the expected value is implicitly conditioned with respect to it. To numerically approximate it, we should simulate many runs starting from the same initial distribution. However, in practice, we desire to have a quantity that would only depend on the parameters of the initial swarm (such as their number, spread, and blind-zone size), rather than a quantity biased by the initial distribution. This is why we allow ourselves to resample the distribution of the agents at each run, and the empirical average convergence time we get is actually an estimate of this quantity depending only on the initial parameters of the swarm rather than on its explicit distribution.
Typical simulation results of gathering are shown in Fig. 10. As expected, the systems converge to a disk of radius , whose centroid wanders in the plane.
We found that, for estimating the expected time of convergence, a choice of runs provides a reliable estimator. A detailed analysis of this choice is presented as supplementary material in Section 1.2.
Figure 11 summarize simulation results with different blind-zone sizes , using agents spread uniformly on the same initial area. Notice how the convergence time is proportional to , similarly to what we found in the bound Eq. 28. As expected, the average convergence time is below this theoretical bound. However, these results show how pessimistic this worst-case bound is, since it is systematically several orders of magnitude higher than the empirical convergence time. Indeed, the slope of the weighted least-squares interpolation of the results is unit time-steps per invert radius distance of the blind-zone, compared to with the same units for the theoretical bound.
Figure 12 summarize simulations with different number of agents, spread uniformly over the same initial area and with same blind-zone radius and small time-step . Empirically we find that the effect of the number of agents on the convergence time of the system is linear, similarly to the discrete case. However, in Eq. 29, the bound is , which once more reveals how pessimistic our worst case is. The slope of the weighted least-squares interpolation of the results is approximately unit time-steps per number of agents, which remarkably is not only within the same order of magnitude as the slope in the discrete version, but also smaller. This suggests that the continuous convergence scheme leads faster to convergence than the discrete rules of motion. This could be due to the fact that in the continuous version overshoot phenomena do not occur: if for instance an agent is allowed to jump but not towards the center of the cluster it only moves little by little and stops when movement negatively impacts convergence. However, simulation runtimes are significantly longer in the continuous case as we need to perform an extra loop of iterations within each unit time-interval.
4 Conclusion
We proposed and analyzed two randomized gathering processes for identical, anonymous, oblivious mobile agents, only capable to sense the presence of other agents behind their motion direction. The agents act synchronously, and at unit time-intervals they randomly select new forward motion orientations. We proved that the “continuous version” of the process ensures gathering to within a region of diameter where is a parameter setting a “blind-zone” in sensing nearby agents. Gathering happens in finite expected time, proportional to . This result also suggests that the “blind-zone” is absolutely necessary for finite expected time convergence.
The fully discrete model, in which agents perform unit jumps forward if no agents are detected behind them, was also found empirically to gather the agents to a minimal enclosing circle of radius randomly varying around , in time proportional to the number of agents. This happens in all cases we tested, however a proof of this result has not yet been found and will certainly involve probabilistic convergence arguments.
We are currently investigating the dynamics of the randomly wandering cluster of agents once gathering has been achieved. We expect to show that the centroid of the system then performs a random walk, or at least undertakes unbiased dynamics. If such is the case, we may be able to guide the swarm by extending our previous work [2] were we presented an algorithm for externally controlled steering of swarms of indistinguishable agents, in spite of the agents’ lack of information on absolute location and orientation.
APPENDICES
Appendix A Proof of Corollary 3.3
Assume with . Assume that there is such that . By continuity of thanks to the continuity of displacements, the intermediate value theorem gives us such that . In particular, we can take . By continuity of we get . If there is a value of such that , then we can once again apply the intermediate value theorem and get such that which contradicts the maximality assumption on . Therefore . We then have, using Eq. 4, that the right derivative of is negative on . For any , we have , which is a contradiction since .
Appendix B Proof of Eq. 20
Consider the case . This implies that . We thus have
[TABLE]
which is always true since trivially .
Now consider the opposite case when . Thus and then
[TABLE]
However we know that by assumption , which thus gives
[TABLE]
According to Eqs. 30 and 31 and since , we have that Eq. 32 is always true.
We have thus proved that in all cases is a strictly positive constant.
1 Analysis of the number of runs necessary for expected convergence time estimation
It is crucial to understand how reliable an estimator is when performing estimations. In this supplementary material, we analyse the choice of random independent iterations needed to estimate the expected time of convergence of the swarms. We believe an estimation is correct when the empirical average has stabilised, in a qualitative sense, to convergence as enforced by the law of large numbers.
1.1 Discrete dynamics
We denote the empirical convergence time of trial , which can be seen as the -th independent realisation of the random variable giving the convergence time , and
[TABLE]
the estimated expected convergence time using the first independent trials. We analyse as a function of and work for the final estimation with a number for which we have qualitatively reached convergence. Furthermore, we also study the evolution of the distribution of realisations of depending on the number of trials so that for our choice of number of trials this distribution has also qualitatively converged.
Analysis of the evolution of with respect to is done in Fig. S1. Empirically, the average convergence time stabilises after a few hundred rounds and has reached convergence using trials. Analysis of the evolution of the distributions of depending on the number of trials is done in Fig. S2. The normalised distributions have qualitatively reached convergence using trials. We thus decided to use simulations in order to estimate average convergence times. Furthermore, the distribution of the convergence time around the mean converges to a symmetric Gaussian-like curve, implying that we can fully summarize the empirical distributions at convergence with the empirical mean and with the traditional empirical unbiased standard deviation estimator
[TABLE]
choosing .
1.2 Continuous dynamics
Similarly to the discrete case, we denote the empirical convergence time of trial , which can be seen as the -th independent realisation of the random variable giving the convergence time , and
[TABLE]
the estimated expected convergence time using the first independent trials. We perform the same analysis of and of the distribution of the realisation as in the discrete case.
Analysis of the evolution of with respect to is done in Fig. S3. Empirically, the average convergence time stabilises after a few hundred rounds and has reached convergence using trials. Analysis of the evolution of the distributions of depending on the number of trials is done in Fig. S4. The normalised distributions have qualitatively reached convergence using trials. We thus decided to use simulations in order to estimate average convergence times. Furthermore, the distribution of the convergence time around the mean converges to a symmetric Gaussian-like curve, implying that we can fully summarize the empirical distributions at convergence with the empirical mean and with the traditional empirical unbiased standard deviation estimator
[TABLE]
choosing .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Ando, Y. Oasa, I. Suzuki, and M. Yamashita , Distributed memoryless point convergence algorithm for mobile robots with limited visibility , Robotics and Automation, IEEE Transactions on, 15 (1999), pp. 818–828, http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=795787&tag=1 .
- 2[2] A. Barel, R. Manor, and A. M. Bruckstein , On steering swarms , in International Conference on Swarm Intelligence, Springer, 2018, pp. 403–410.
- 3[3] A. Barel, R. Manor, and A. M. Bruckstein , Come together: multi-agent geometric consensus (gathering, rendezvous, clustering, aggregation) , ar Xiv preprint ar Xiv:1902.01455, (2019).
- 4[4] A. Barel, R. Manor, and A. M. Bruckstein , Probabilistic gathering of agents with simple sensors , ar Xiv preprint ar Xiv:1902.00294, (2019).
- 5[5] L. I. Bellaiche and A. M. Bruckstein , Continuous time gathering of agents with limited visibility and bearing-only sensing , tech. report, CIS Technical Report, TASP, 2015.
- 6[6] A. M. Bruckstein, N. Cohen, and A. Efrat , Ants, crickets and frogs in cyclic pursuit , Technion CIS report 9105, 1991.
- 7[7] V. Gazi and K. M. Passino , Stability analysis of swarms , IEEE Transactions on Automatic Control, 48 (2003), pp. 692–697.
- 8[8] N. Gordon, Y. Elor, and A. M. Bruckstein , Gathering multiple robotic agents with crude distance sensing capabilities , in Ant Colony Optimization and Swarm Intelligence, vol. 5217 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, 2008, pp. 72–83.
