Persistification of Robotic Tasks
Gennaro Notomista, Magnus Egerstedt

TL;DR
This paper introduces a control framework that guarantees robots can perform tasks continuously over extended periods by maintaining their energy levels, using control barrier functions and optimization to ensure persistent task execution.
Contribution
The paper presents a novel control approach that ensures persistent robotic task execution by integrating energy constraints into the control design using set invariance and barrier functions.
Findings
Framework successfully maintains robot energy levels during task execution.
Simulation and Robotarium experiments validate persistent environment exploration and surveillance.
Control method ensures tasks are performed over extended durations without energy depletion.
Abstract
In this paper we propose a control framework that enables robots to execute tasks persistently, i.e., over time horizons much longer than robots' battery life. This is achieved by ensuring that the energy stored in the batteries of the robots is never depleted. This is framed as a set invariance constraint in an optimization problem whose objective is that of minimizing the difference between the robots' control inputs and nominal control inputs corresponding to the task that is to be executed. We refer to this process as the persistification of a robotic task. Forward invariance of subsets of the state space of the robots is turned into a control input constraint by using control barrier functions. The solution of the formulated optimization problem with energy constraints ensures that the robotic task is persistent. To illustrate the operation of the proposed framework, we consider…
Click any figure to enlarge with its caption.
Figure 1
Figure 1
Figure 2
Figure 2
Figure 3
Figure 3
Figure 4
Figure 4
Figure 5
Figure 0
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 0
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Persistification of Robotic Tasks
Gennaro Notomista and Magnus Egerstedt This work was sponsored by the U.S. Office of Naval Research through Grant No. N00014-15-2115.G. Notomista (corresponding author) is with the School of Mechanical Engineering, Institute for Robotics and Intelligent Machines, Georgia Institute of Technology, Atlanta, GA 30332 USA (email: [email protected])M. Egerstedt is with the School of Electrical and Computer Engineering, Institute for Robotics and Intelligent Machines, Georgia Institute of Technology, Atlanta, GA 30332 USA (email: [email protected])
Abstract
In this paper we propose a control framework that enables robots to execute tasks persistently, i. e., over time horizons much longer than robots’ battery life. This is achieved by ensuring that the energy stored in the batteries of the robots is never depleted. This is framed as a set invariance constraint in an optimization problem whose objective is that of minimizing the difference between the robots’ control inputs and nominal control inputs corresponding to the task that is to be executed. We refer to this process as the persistification of a robotic task. Forward invariance of subsets of the state space of the robots is turned into a control input constraint by using control barrier functions. The solution of the formulated optimization problem with energy constraints ensures that the robotic task is persistent. To illustrate the operation of the proposed framework, we consider two tasks whose persistent execution is particularly relevant: environment exploration and environment surveillance. We show the persistification of these two tasks both in simulation and on a team of wheeled mobile robots on the Robotarium.
I Introduction
Robotic tasks such as environmental monitoring and exploration, as well as sensor coverage, typically evolve over long time horizons. However, robots employed to execute these tasks can only store and carry a limited amount of energy in their batteries. For this reason, we can say that such tasks are not persistent insofar as either the robots cannot complete them before their batteries deplete, or they are required to be executed repeatedly and continuously. Although robots can be designed with greater energy capacity to handle longer-duration tasks, hardware solutions will never allow robots to operate perpetually.
The objective of this paper is presenting a control framework that provably guarantees the persistent execution of robotic tasks. This is achieved by minimally modifying the nominal control inputs corresponding to the task that the robots have to execute in order to ensure the continuous execution of the task. As a result, the robots are allowed to freely execute their task whenever they have enough energy stored in their batteries, whereas they are forced to go and recharge their batteries whenever they are running out of energy.
The deployment of robots for tasks such as environmental monitoring [1, 2, 3, 4], environment exploration [5, 6, 7] and sensor coverage [8, 9, 10] has been extensively investigated. However, despite the fact that, in most cases, these tasks have to be executed over long time horizons, the limited availability of energy is not directly taken into account. Nevertheless, since low energy density is still a severe limiting factor in many mobile robotic applications, energy-awareness is a necessary feature with which robots have to be endowed [11]. This line of inquiry has been followed in [12], which considers a multi-robot, persistent coverage problem as a variant of the vehicle routing problem. A heuristic algorithm is proposed that is based on the cost-to-go-to-target, which can be adjusted online to take into account detours that pass through refueling stations present in the environment. A different approach is adopted in [13], where a formulation based on Markov decision processes is presented, that is able to ensure persistence surveillance coverage, including communication constraints and sensor failure models.
Energy-aware control policies for persistent surveillance using a team of robots are considered in [14], where an optimization problem is defined in order to trade-off between the coverage mission and the conservation of a desired energy level. This is achieved by transitioning between coverage-directed and charging-directed behaviors, based on the current energy levels. The coverage performances are improved by employing “standby” robots that can be deployed when a robot is docked at a charging station. Similarly, in [15] a solution to the problem of long-duration missions is proposed, which considers the team of robots split into “task robots”, which are in charge of executing tasks, and “delivery robots”, which are deployed with the goal of providing the task robots with the energy resources they request. Also in [16], the strategy consists in making use of a team of robots dedicated to charging tasks (ground mobile docking stations), whose trajectories are planned based on the working robots’ trajectories (UAVs), in order to guarantee randez-vous and recharge without suspending the operation of the working robots. Limited energy reserve is used as an additional constraint in [17] for a path planning strategy for the optimal deployment of multi-robot teams. A definition of persistency different from the one introduced in this paper is considered in [18], where environment persistent monitoring is solved by varying the robots’ speed along predefined trajectories with the goal of keeping a regenerating environment information field bounded, analogously to what happens to a robot cleaner in an environment in which dust is generated.
The persistification approach, through which a robotic task is rendered persistent, that we present in this paper, leverages control barrier functions (CBFs) to formulate an optimization problem where the task execution is constrained by the robots’ energy level. As formalized in [19] and [20], the constraint-driven formulation resulting from the use of CBFs is particularly amenable for long-duration robotic tasks, where goal-driven strategies, derived starting from precise model assumptions, do not guarantee enough robustness.
CBFs can be used to synthesize a controller that ensures the forward invariance of a set of the robot state space. This way, defining as the set where the battery energy level of the robots executing the task is always greater than a desired minimum value, the persistification of a task can be formally guaranteed by ensuring the forward invariance of . The first definition of CBFs was given in [21]; in this paper, we are using the one introduced in [22]. See [23] for a survey on the subject. In [24, 25, 26], variations have been introduced in order to employ CBFs with different categories of nonlinear dynamic systems for different control applications.
The persistification framework described in the following extends the work in [27]. More specifically, the main contribution of this paper is the reformulation of the task persistification introduced in [27] in order to be able to handle more complicated robot and energy dynamics. The presented method generalizes to different charging models as well as different robotic tasks. The tasks will be encoded through different nominal inputs to the robots. This allows us to formally guarantee the persistent execution of a large number and variety of robotic tasks.
The remainder of the paper is organized as follows: in Section II, the environment and robot models considered in this paper are presented, and the problem of persistification of robotic tasks is formulated. In Section III, the control framework required to ensure robotic task persistence is introduced and its application is demonstrated by means of preliminary examples. Section IV discusses the application of the presented framework to the persistification of robotic tasks. Section V is dedicated to the validation of the proposed theoretical formulation through the implementation of two robotic tasks whose persistent execution is particularly relevant, both in simulation and on a team of mobile robots.
II Problem Formulation
The goal of this paper is the persistification of robotic tasks, i. e., ensuring that the battery energy level of the robots executing the tasks never falls below a minimum value. In this section, we introduce the models used for the robots, the environment in which the robots execute their task, and the robots’ energy dynamics. We conclude the section by formalizing the persistification of robotic tasks.
II-A Robot Model
Consider a collection of robots which are to be deployed to execute a task. The state of robot , , is denoted by . We model the robots using the control affine dynamical system:
[TABLE]
where is robot ’s input, and and are two Lipschitz vector fields. Control affine dynamics arise in many robotic systems, whose models are derived using Euler-Lagrange equations (as observed in [28]), therefore they lend themselves to the description of a large variety of robotic platforms. Throughout this paper, we will assume that the robots are homogeneous, i. e., and in (1) are the same for each robot. This assumption does not compromise the proposed persistification strategy, which can be easily employed in the heterogeneous case, as will be explained in Section IV.
The robots considered in this paper are equipped with a rechargeable source of energy, e. g., a battery, and a technology required to recharge it, e. g., solar panels. In the following two subsections, we present a model for the robot energy dynamics which is coupled with the model of the environment in which the robots move. This encompasses, for instance, the situation in which solar power harvester circuits are employed to use solar panels to recharge the battery of the robots.
II-B Environment Model
The environment, i. e., the domain in which the robots are deployed to perform their task, is represented by the compact set , with or , for ground or aerial robots, respectively. The function
[TABLE]
maps the robot state to its position expressed in a Cartesian reference system defined in the environment .
Moreover, we consider the time-varying scalar field
[TABLE]
where is an interval, defined over the environment, which represents a bounded time-varying non-negative physical quantity (e. g., solar light intensity) associated to each position in at each time instant. We insist on being Lipschitz continuous in its first argument and differentiable in its second argument. The need for these assumptions will be explained in the next section.
II-C Energy Model
Let be the battery energy level of robot . The charging and discharging dynamics of the battery are modeled by:
[TABLE]
where and
[TABLE]
In (5), and are two scalars whose meaning and effect on the energy dynamics will be explained in Remark 1, and is a time-varying scalar-valued function introduced in (3).
The specific expression of has been defined in order to model the exponential charging-discharging dynamics of batteries that are used to power robotic platforms in a large number of applications [29].
In our previous work [30], the design of a solar-powered robot is presented and data of solar intensity and battery charge collected during the course of a 1-day-long experiment are reported (see Fig. 1a). Figure 1b shows the difference between the measured and its value predicted using the energy model (4). The thick green line depicts the mean of and the shaded area denotes the region of one standard deviation around the mean. Moreover, Fig. 1c shows the simulated battery charge and discharge curves that can be obtained using (4) in the cases when and , respectively: a comparison with Fig. 1a indicates that the theoretical model is able to capture the exponential charging and discharging dynamics of real batteries commonly used for robotics applications.
Remark 1**.**
* can be interpreted as a time-varying power source distributed over the environment . For instance, in the case where it represents a measure of the solar light intensity at the position and time , can be used to describe the energy dynamics of a solar rechargeable battery. With the given dynamics we have that:*
- •
, i. e., the battery is discharging, whenever
- •
, i. e., the battery is charging, when
- •
, i. e., the value of is such that the generated energy is equal to the energy required by the robot at time .
Figure 2 shows an example of what has been described: the surface plot of the function at a given time instant is depicted in grayscale (black to white for values of that go from 0 to 1). Below the surface plot, the contour plot of highlights the level curves where . Inside the regions bounded by the bold curves, characterized by , , i. e., the robots can charge their batteries.
Remark 2**.**
Lumped sources of energy, such as charging stations, can be also modeled using (3). Bump-like functions [31] at the locations of the charging stations can be employed to obtain the desired charging behavior, as depicted in Fig. 3.
Remark 3**.**
The proposed energy model does not depend on the control input of the robot. At first, this can seem inaccurate, however in Section IV we will show why this choice was made and how it increases the robustness of the proposed persistification approach, ensuring that the robots will never run out of energy while executing the given task.
II-D Task Persistification
The compound model of robot and energy dynamics is given by the following set of differential equations:
[TABLE]
Indicating the augmented state of robot by
[TABLE]
the robot model (6) can be rewritten in the following control affine form:
[TABLE]
where
[TABLE]
We will use
[TABLE]
to represent the joint states and inputs of the robots performing the task to be persistified.
Let the task that has to be executed by the robots be encoded through the nominal input
[TABLE]
or collectively as:
[TABLE]
This modeling choice for the tasks that the robots have to execute is general insofar as it encompasses both reactive feedback controllers, through the dependence of from the state , and controllers generated by a high-level planning strategy, through the dependence of from the time .
Examples of such tasks include, for instance, the stabilization of a dynamical system, the coordinated control of multi-robot systems, as well as robot collision avoidance strategies. The execution of robotic tasks using this formulation will be showcased in Section IV.
Definition 4** (Task Persistification).**
A task encoded through the nominal input , defined in (11), is persistified if the robots employed to perform it execute the input , solution of the following program:
[TABLE]
, where is the energy of robot introduced in Section II-C, and and , with , are the minimum and the maximum energy values, respectively, between which the energy of the robots should be confined.
Task persistification is referred to as the process of turning a task characterized by the nominal input to the persistified task characterized by the input .
Remark 5**.**
Definition 4 is not tailored to the specific robot and environment models, which, as a matter of fact, can be quite different from the ones presented above, depending on the particular application that is considered.
III Control Framework
As explained in Section I, the task persistification expressed as the optimization problem (12), will be realized by employing control barrier functions (CBFs). In the following subsection, we give a brief introduction to CBFs in their basic form. Then, in Section III-B, we present the extensions that represent the main theoretical contribution of this paper, namely a flexible framework to deal with time-varying and high relative degree CBFs and control Lyapunov functions (CLFs). The results obtained in this section will be employed in Section IV to formulate the optimization program whose solution achieves the persistification of robotic tasks.
III-A Control Barrier Functions
Control barrier functions have been used with the goal of ensuring safety, intended as the invariance property of a subset of the state space, the safe set. In the following we give the definition introduced in [32] which will be extended in this paper.
Definition 6** ([32]).**
Let be a continuously differentiable function, and its zero superlevel set, i. e., . Then, for a control affine system
[TABLE]
, , is a control barrier function (CBF) if there exists a locally Lipschitz extended class function [32] such that
[TABLE]
for all in the interior of the set . and represent the Lie derivative of in the directions of the vector fields and , respectively.
Starting from this basic definition, in the next sections we will develop tools required for the persistification of robotic tasks.
III-B High Relative Degree CBFs and CLFs
In the previous section, we introduced the robot and energy models, and in Remark 3, we have pointed out that the energy dynamics do not depend explicitly on the robot input . This is intended to be a conservative choice which increases the robustness of the persistification strategy. In fact, the rate of charge and discharge of the battery, obtained when and , respectively, are designed to be the rates obtained when the robot input attains its maximum value. This would correspond, for instance, to the fastest discharge rate obtained when the actuators of the robot are absorbing maximum power. Then, the actual discharge rate will always be slower than the modeled one, increasing, this way, the robustness of the proposed persistification strategy against unmodeled phenomena which can occur in the environment, or unmodeled robot dynamics. Nevertheless, the gained robustness comes at the price of increasing the relative degree of the CBF , defined below.
Definition 7** (Relative degree of a CBF, based on [25]).**
Given the nonlinear system (13), with and sufficiently smooth vector fields on a domain , the CBF has relative degree , , in if the system
[TABLE]
has a relative degree , .
In the following, we give an example in which high relative degree CBFs [25] are required. This example will be generalized in Theorem 9 for arbitrarily high relative-degree CBFs. Theorem 9 will be then applied in order to formulate an optimization program which realizes the proposed persistification strategy.
Example 8** (Cascade of CBFs).**
Let us consider the nonlinear dynamical system in control affine form (13) and a sufficiently smooth function with relative degree 2 (i. e., and ) that defines the superlevel set . To prove the forward invariance of the set , we want to be a CBF for which the following must hold:
[TABLE]
where is a continuously differentiable extended class function, and we used the fact that has relative degree 2. Then, we can define an additional function
[TABLE]
whose zero superlevel set is . If there exists a positive constant and a locally Lipschitz extended class function such that
[TABLE]
then the function is a CBF. The condition in (18) has been obtained using the definition (17) and employing . Its expression for an arbitrary high relative degree will be given in the next theorem. The existence of the CBF ensures the forward invariance of the set , which, in turn, ensures the existence of the CBF . The forward invariance of the set is thus proved.
The technique shown in example 8 is generalized in the following theorem.
Theorem 9**.**
Given a dynamical system (13), a sufficiently smooth CBF with relative degree and a CBF which can be evaluated recursively starting from using the following equation
[TABLE]
with continuously differentiable extended class functions, we define the set as
[TABLE]
where is the set of -combinations from the set and is a locally Lipschitz extended class functions. Then, any Lipschitz continuous controller will render the set forward invariant.
Proof.
Given the properties of class functions [33] and using the fact that has relative degree , is given by the following expression:
[TABLE]
Consequently, the choice of renders the set forward invariant. By recursively applying (19) times, is proved to be forward invariant. ∎
Remark 10**.**
Employing a cascade of CBFs as shown in Example 8 and in Theorem 9 is a technique which can be used not only to prove set forward invariance, but also stability of dynamical systems using high relative degree Lyapunov functions, as will be shown in the following. Set forward invariance and stability will be used, in Section IV, to ensure that the energy stored in the battery of the robots performing a task is never depleted, realizing, this way, the desired task persistification.
Note that the energy model proposed in Section II-C is time-dependent, as it depends on the environment model, through the value at at time . As we will use CBFs to ensure the persistent execution of a task, we now extend the notion of CBFs to the case in which the function that defines the safe set explicitly depends on time.
For the nonlinear control affine system (13), we wish to ensure the forward invariance of a set defined by the superlevel set of a function as:
[TABLE]
With this objective, we extend the definition of CBFs given in [32] to the time-varying case.
Definition 11** (Time-Varying CBFs).**
Given a dynamical system (13) and a set defined in (22), the function is a time-varying CBF defined on , with , if there exists a locally Lipschitz extended class function such that, ,
[TABLE]
Starting from the condition in (23), we can define the set:
[TABLE]
The following lemma ensures that the set , defined in (22), is rendered forward invariant by the application of a control input . This result will be used in Section IV to express the constraints on the energy in (12) in terms of the control input .
Lemma 12**.**
Given a set defined as in (22), if is a time-varying CBF on , then any Lipschitz continuous controller , where is given in (24), will render the set forward invariant.
Proof.
(Following the proof of Theorem 1 in [22]) If is a time-varying CBF on and , from (23) and (24) we can derive the following differential inequality:
[TABLE]
Now, consider the following boundary condition problem:
[TABLE]
whose solution is given by , being a class function (Lemma 4.4 in [34]). From (25), making use of the Comparison Lemma (Lemma 3.4 in [34]), we have that:
[TABLE]
Hence, if and therefore , using the properties of class functions, (27) ensures that and so . Thus, is forward invariant. ∎
Remark 13**.**
In case and we are not able to ensure the existence of a control input such that (23) holds, condition on which Lemma 12 relies. This case can be tackled by making use of a cascade of control barrier functions and the result of Theorem 9.
So far, we have described the condition in which the robots are executing the assigned task and we want them to keep their energy level within a certain interval . This is realized, employing CBFs, by letting the robots reach regions of the state space where the field introduced in (3) is larger than . In these regions, as observed in Remark 1, and the robots are charging.
The use of CBFs will allow the robots to keep the energy stored in their batteries within the desired interval by synthesizing a controller solution of an optimization problem at each time instant. Nevertheless, although computationally efficient, this point-wise in time approach prevents the robots from planning their charging strategy. Therefore, in order to make sure that the robots leave the charging stations—intended, in a broader sense, as the regions of the state space where —only when their battery is fully charged, i. e., when , we will make use of control Lyapunov functions (CLFs). Similarly to what happens with CBFs encoding energy constraints, since the input does not directly show up in the expression of , a CLF defined to fully recharge the robots’ battery will have relative degree higher than 1. Therefore, in the following, we will proceed analogously to what has been done for high relative degree CBFs to handle the case of high relative degree CLFs.
Suppose that, besides the forward invariance of a set, one would like to stabilize the dynamical system (13) around the origin using a CLF. Similarly to what has been done for the CBFs, the existence of a CLF suggests the definition of the following set:
[TABLE]
It is easy to see how the choice of a control input will stabilize the system around .
Remark 14**.**
In case , i. e., the relative degree of the Lyapunov function is greater than 1, we cannot ensure that the origin is stable.
In Example 8 and in Theorem 9, a technique for dealing with high relative degree control barrier functions has been introduced: we will proceed here in a similar fashion. We first give an example that shows how to construct high-relative degree CLFs by employing CBFs. Then, we generalize this construction in Theorem 16.
Example 15** (Constructing high relative degree CLFs using CBFs).**
Let us consider the nonlinear dynamical system in control affine form (13) and a sufficiently smooth function with relative degree 2, i. e., and . In order for to be a CLF we must have . We can then define the CBF , and its superlevel set , and let . where is a locally Lipschitz class function. This way, the set can be rendered forward invariant. Consequently, the existence of the CLF guarantees that the origin is (asymptotically) stable.
Theorem 16**.**
Consider the dynamical system (13), a CLF with relative degree defined with the objective of stabilizing the system state to , and a CBF which can be evaluated from using the following recursive formula:
[TABLE]
where are continuously differentiable extended class functions. In addition, assume that is the largest invariant set in , boundary of the set . Then, any Lipschitz continuous controller
[TABLE]
where is a locally Lipschitz class function, will asymptotically stabilize the system to .
Proof.
Given the properties of class functions and using the fact that has relative degree , the following expression of can be derived:
[TABLE]
The choice of will render the set forward invariant. Similarly to what has been done in Theorem 9, by the recursive application of (29), is proven to be forward invariant. Then, by LaSalle’s Theorem (Theorem 4.4 in [34]), as is the largest invariant set in , one has that as . ∎
IV Application to Robotic Tasks
In view of what has been introduced in the previous section, in this section we show that the persistification of robotic tasks, specified in Definition 4, can be framed as a constrained optimization problem.
In order for the robots to be able to perpetually execute the given task, their energy must be strictly greater than zero at each point in time. Moreover, in order to extend the battery life of the specific types of batteries used in robotic applications, the lower bound for the residual energy should not be too low in order to protect the battery from deep discharge [35]. Considering the robot model (6), the constraints to control the residual energy can be formally encoded by the following CBF related to robot :
[TABLE]
where and are the upper and lower bounds between which we want the energy to be confined, corresponding to charged and depleted battery, respectively.
Following the procedure adopted in Example 8, we start by evaluating the time derivative of :
[TABLE]
which does not depend on as the relative degree of is 2. Therefore, we define the CBF as done in (17), namely:
[TABLE]
in which the locally Lipschitz class function has been chosen to be a linear function , with .
Using Theorem 9 and Lemma 12, we can define the set of control inputs that will render the set forward invariant:
[TABLE]
in which , with , is the locally Lipschitz extended class function in (18). The expressions of , and are reported in full in Appendix A.
Note that, due to the control affine form of (7), is an affine constraint in and therefore it can be written as:
[TABLE]
with and .
Remark 17**.**
Recalling the expression of introduced in (4), the behavior resulting by enforcing the constraint (36) is the following: when the battery level is getting close to its minimum value , robot will drive towards areas of the environment where the value of the function is such that , i. e., robot starts recharging its battery.
During operation, the battery of each robot continuously discharges according to the dynamics in (4). The mere application of the constraint (36) prevents the battery level to go lower than or higher than . However, it does not ensure that the battery will be completely charged before robot leaves areas of the environment where . This behavior is desirable for two reasons: (i) this kind of charging/discharging cycles will extend the life of robot batteries [29, 35], and (ii) it is more efficient from the point of view of the time spent by the robots recharging their batteries allowing them to deviate from the nominal task input for less time. A quantitative justification of the second reason is given in Section V-A.
This behavior can be encoded using a CLF whose objective is that of driving the energy to . We can then define the following CLF:
[TABLE]
related to robot , whose time derivative is given by
[TABLE]
As in the case of in the previous section, here has relative degree 2 and therefore its time derivative is not a function of the control input . Proceeding as before, we define the CBF . Its superlevel set is the set in which the value of the function is not increasing. Its boundary is the set where which, if , coincides with the -dimensional manifold . Therefore, by Theorem 16, if
[TABLE]
with , the value of will asymptotically converge to . The expressions of , and for the energy model (4)-(5) are reported in full in Appendix A. Note that is an affine constraint in , and therefore it can be written as:
[TABLE]
with and .
In order to combine the CBF constraints (36) and the CLF constraints (40), the following nonlinear program can be formulated:
[TABLE]
where
[TABLE]
, and and are identity and zero matrices, respectively. Moreover, is a vector of relaxation parameters introduced to make the constraints in (41) always feasible. The matrix is a diagonal, positive definite, weighting matrix for . Furthermore, the nominal input is what encodes the task introduced in Definition 4.
Both (36) and (40) are affine functions of the optimization variables. Moreover, the cost is a convex quadratic form. Hence, (41) is a convex quadratic program (QP) and, as such, can be efficiently solved—see, e. g., [36]—and employed to generate controllers in an online fashion as shown, for instance, in [37].
Remark 18**.**
As discussed in Section II, the proposed persistification approach also works when the robots are not homogeneous, i. e., when they are characterized by different dynamic models. In fact, in this case, once the constraints of the optimization program (41) have been changed accordingly, the solution guarantees the heterogeneous multi-robot system persistently executes the task characterized by the nominal input .
Remark 19**.**
In order to achieve the desired charging behavior, discussed in Section III, can be made a function of in such a way that robot charges up to once it started charging and, at the same time, discharge down to while operating. A candidate mapping is depicted in Fig. 4. Note that the function changes according to the sign of . This way, the weight of the corresponding relaxation parameter changes when the energy reaches or .
We conclude this section stating the proposition which ensures the task persistification as defined in Definition 12.
Proposition 20**.**
The control law , solution of the QP (41), makes the robots execute the persistified task corresponding to the task encoded through the control input .
Proof.
Follows from the application of Theorem 9, Lemma 12 and Theorem 16. ∎
In the next section, the developed theoretical control framework will be validated by means of simulations and experiments. Two robotic tasks are introduced and the results of the their persistent implementation are reported.
V Simulations and Experiments
In this section, two robotic tasks, whose persistent application is particularly relevant, are presented to showcase the persistification strategy presented in this paper. The tasks consist in environment exploration and environment surveillance. Both tasks are typically required to be executed for a long period of time. In the case of environment exploration, the long execution time can be due to the size and/or the dynamic nature of the environment to explore [38, 39]. As regards the environment surveillance, the time-scale of the observed environment phenomena is the factor that determines the length of the task. Nowadays, longevity is still a limiting factor for the deployment of robotic systems for environment surveillance and monitoring, as described in [40]. The persistification of these two tasks through the control framework described in the previous section is discussed in Section V-A and Section V-B, respectively.
V-A Environment Exploration
The first application that is considered is that of environment exploration. There are many approaches to this task in literature, as discussed in Section I, and many solutions have been proposed. Here we consider the one presented in [41], since the result of a trajectory optimization problem provides directly the nominal inputs to the robots. The optimal trajectories are evaluated by minimizing the distance from ergodicity [42]. This results in a trajectory that, instead of maximizing information in a greedy way, distributes information according to its probability density function defined over the environment.
Following what is presented in [41], let us start by defining the following ergodic metric as in [42]:
[TABLE]
where are the time-averaged Fourier coefficients of the trajectory, are the Fourier coefficients of a spatial distribution of information , being the environment to explore, and
[TABLE]
with , being the number of employed Fourier basis functions.
By minimizing this ergodic measure at time over a time horizon , the nominal trajectory for of each robot is obtained. Solving this optimization problem at every time instant, in a model predictive control fashion, provides the nominal input that is to be executed at time in order to track the trajectory [41]. This can be plugged in the QP defined in (41) allowing, this way, a straightforward application of the persistification framework presented in this paper to the environment exploration task.
The persistent environment exploration task has been implemented and tested in a simulation environment. For the simulated experiment a planar robot is given the task of exploring an environment on which a spatial distribution of information has been defined. The information is assumed to be distributed according to the Gaussian density function
[TABLE]
where, for the experiments, the following values are used: and . The time-varying environment field is modeled as a mixture of time-varying Gaussians of the following form:
[TABLE]
where and
[TABLE]
In the case of robots that are able to exploit solar power to recharge their batteries, this choice of the function simulates the sunlight intensity that is characterized by a periodic expression over a spatially fixed environment. The other values required to model (4) in (6) are set to and .
The optimization problem aimed at minimizing the ergodic cost (43) is solved offline and the resulting trajectory is given to the robot as a reference for the tracking controller. The input required to track the trajectory is wrapped by the QP (41) in such a way that the robot explores the environment while satisfying at the same time the energy constraint that prevents its battery level to go below the lower threshold . The result of this controller is a persistified environment exploration.
Figures 5a to 5e show a sequence of snapshots taken during the course of the environment exploration experiment described above. The contour plot of the function is depicted as green thin solid lines, while the contour plot of the environment field is represented by the yellow thin dashed lines. The position that the robot is tracking under the nominal control input is depicted as a black square, whereas its actual position executing the controller (41) is represented by a black circle. Furthermore, nominal and executed trajectories are represented by a gray thick solid line and a red thick dashed line, respectively.
Figure 6 compares the probability density function representing the spatial information distribution (Fig. 6a) with the probability density functions for the time-averaged optimized trajectory with energy constraints (Fig. 6b) and without energy constraints (Fig. 6c). Even though the latter more closely matches the spatial distribution , it does not take into account that the robot has a finite availability of energy.
In Fig. 7, the energy level of the robot during a persistified long-term exploration experiment are reported. The application of the control framework presented in this paper is demonstrated to be successful in persistifying the robotic exploration. This is realized by keeping the robot energy level constrained above a minimum value in an optimal way by means of the QP (41). This way, the robot is completely free of tracking the ergodic trajectory given as input to its motion controller, as long as its battery level is above the lower threshold and below the upper threshold , depicted as thick solid lines in Fig. 7.
In Section IV, CLF constraints were introduced with the objective of completely recharging the battery of the robots before leaving areas of the environment where , referred to as charging stations. This leads to a more efficient task execution in the sense that the robots overall spend less time recharging their batteries. This condition allows them to deviate from the nominal task input of a smaller amount over the course of the experiment. In Fig. 8, the value of at time represents the integral over the time interval of the difference between the nominal control input —corresponding to the exploration task—and the input executed by the robot:
[TABLE]
The green curve is generated letting the robot execute the input , solution of (41). The red curve corresponds to the situation where the robot executes the input solution of (41) in which the CLF constraint (40) for battery recharging has been removed. As can be seen, the battery recharging constraint allows the robot to visit charging stations less often, so that its input deviates from the nominal one of a smaller amount over the course of the task execution.
V-B Environment Surveillance
The second application that is considered as showcase is environment surveillance. The employment of mobile sensors improves coverage and data gathering performances compared to static sensors, whose positions are determined based on offline optimization algorithms. In this sense, mobility can allow a more efficient estimation of time-varying information fields. However, this comes at the price of higher power consumption. Most of the approaches developed so far assume that the mobile sensors are able to move for an unlimited amount of time. The control framework presented in this paper can be used to persistify such surveillance tasks.
The task of environment surveillance can be framed as a sensor coverage control problem, that is an instance of the broader optimal sensor placement problem whose applications can be found in many other disciplines, such as [43]. As in Section V-A, let the map represent a spatial distribution density function. This can be interpreted as a measure of the information spread over the environment or the probability that an event can take place at a location . Moreover, let us define the locational optimization function as in [8]:
[TABLE]
where are the positions of the robots present in the environment , is a partition of , is a non-decreasing differentiable function describing the degradation in the sensing performances of the robots. Proceeding as in [8], we aim at minimizing with respect to both and . The minimization with respect to the environment partition leads to [43], being the Voronoi partition of defined by the Voronoi cells
[TABLE]
As far as the minimization with respect to the robot locations is concerned, considering the single integrator dynamics of the robots, and setting allows us to define the following gradient descent update step to be used as a control law for moving the robots [44]:
[TABLE]
In (51), is a proportional gain and is the centroid of the -th Voronoi cell defined as:
[TABLE]
In case the map is time-varying, the extension presented in [9] and [45] can be employed. The coverage task with and given by (51), can be persistified by the implementation of the optimization-based controller solution of the QP in (41).
The persistified environment surveillance has been implemented and deployed on the Robotarium, a remotely accessible swarm robotics research testbed [46]. A team of 7 differential-drive robots attempts to cover the m testbed area by making use of the coverage control introduced above. The Robotarium is endowed with wireless charging stations, allowing the modeling of the charging field given in (3) by means of bump-like functions as depicted in Fig. 3. The robot model is a single integrator model, where we can directly control the robots’ velocities utilizing the proportional control law given in (51). This input is wrapped by the QP (41) in order to satisfy the energy constraints. This results in a persistified environment surveillance.
Figures 9a to 9d show the salient frames of the persistent environment surveillance experiment performed on the Robotarium. The Voronoi cells are superimposed on the frames. The yellow and blue wireless charging station are arranged along one of the edges of the testbed. Following the nominal controller (51), the robots perform sensor coverage (9a). The actual controller executed by the robots is the solution of (41), which allows them to go back and recharge their batteries to prevent the stored energy from going below the minimum desired value as shown in 9b, 9c and 9d (the charging stations occupied by robots are marked in red).
In Fig. 10, the thin line shows the value of the locational cost defined in (49) evaluated during the course of the experiment. Here the information distribution density function has been set to a constant value, meaning that the information is equally spread over the testbed. The thick line of Fig. 10 represents the value of the cost (49) obtained if no energy constraints were imposed, in which case, the robots are able to asymptotically reach a centroidal Voronoi configuration, where each robot is in the centroid of its corresponding Voronoi cell. This configuration leads to a local minimum of (49). The value of decreases in correspondence of the situations in which all the available robots are not constrained to charge and are consequently free of following the coverage control input.
Figure 11 displays the actual energy stored in the batteries of the robots as measured during the course of the experiment. The inputs to the robots are constrained by (36) and (40) in such a way that their energies never exceed , while never going below either. As a results of the optimization-based formulation, each individual robot is able to follow as closely as possible the input encoding the coverage task as long as its battery level satisfies the imposed constraints.
VI Conclusions
In this paper we introduced the concept of robotic task persistification, i. e., the process of rendering a robotic task persistent. This allows robots to execute a task over long time horizons, by ensuring that the energy stored in their batteries never gets depleted. The control of the robot energy level is wrapped around the task controller by utilizing control barrier functions and control Lyapunov functions combined in a single optimization-based controller which can be efficiently executed in an online fashion. The result of this formulation is a control framework that allows the robots to execute as closely as possible the assigned task, while simultaneously never depleting the energy in their batteries. This constraint is enforced by expressing the task persistence condition in terms of the forward invariance of a subset of the state space of the robots. The forward invariance property is then ensured by employing control barrier functions, whereas battery recharging is achieved by defining a suitable control Lyapunov function.
The results in Theorem 9, Lemma 12 and Theorem 16 allow us to apply the presented framework to many different kinds of robots endowed with rechargeable, or even interchangeable, sources of energy. In order to be able to efficiently enforce energy constraints, we insist on the robot dynamic model being in control affine form, case that is often encountered in robotic applications. Since the persistified task is obtained as the output of an optimization problem, the robots are free to execute the given task as closely as possible as long as their source of energy is not discharging below a given lower threshold. The persistification strategy has been applied to environment exploration and monitoring tasks, and it has been tested both in simulation and on an team of ground mobile robots.
Appendix A
For single integrator robot dynamics and the CBFs (32) and (34), the expressions of , and required to evaluate (36) are given by:
[TABLE]
For single integrator robot dynamics and the CLF (37), the expressions of , and required to evaluate (40) are given by:
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Ollero, S. Lacroix, L. Merino, J. Gancet, J. Wiklund, V. Remuß, I. V. Perez, L. G. Gutiérrez, D. X. Viegas, M. A. G. Benitez et al. , “Multiple eyes in the skies: architecture and perception issues in the comets unmanned air vehicles project,” Robotics & Automation Magazine , vol. 12, no. 2, pp. 46–57, 2005.
- 2[2] E. Fiorelli, N. E. Leonard, P. Bhatta, D. A. Paley, R. Bachmayer, and D. M. Fratantoni, “Multi-AUV control and adaptive sampling in Monterey bay,” Journal of Oceanic Engineering , vol. 31, no. 4, pp. 935–948, 2006.
- 3[3] N. E. Leonard, D. A. Paley, R. E. Davis, D. M. Fratantoni, F. Lekien, and F. Zhang, “Coordinated control of an underwater glider fleet in an adaptive ocean sampling field experiment in Monterey bay,” Journal of Field Robotics , vol. 27, no. 6, pp. 718–740, 2010.
- 4[4] R. Graham and J. Cortés, “Adaptive information collection by robotic sensor networks for spatial estimation,” Transactions on Automatic Control , vol. 57, no. 6, pp. 1404–1419, 2012.
- 5[5] B. Kuipers and Y.-T. Byun, “A robot exploration and mapping strategy based on a semantic hierarchy of spatial representations,” Robotics and Autonomous Systems , vol. 8, no. 1, pp. 47–63, 1991.
- 6[6] R. O’Flaherty and M. Egerstedt, “Optimal exploration in unknown environments,” in International Conference on Intelligent Robots and Systems (IROS) . IEEE, 2015, pp. 5796–5801.
- 7[7] W. Burgard, M. Moors, D. Fox, R. Simmons, and S. Thrun, “Collaborative multi-robot exploration,” in International Conference on Robotics and Automation , vol. 1. IEEE, 2000, pp. 476–481.
- 8[8] J. Cortes, S. Martinez, T. Karatas, and F. Bullo, “Coverage control for mobile sensing networks,” Transactions on Robotics and Automation , vol. 20, no. 2, pp. 243–255, 2004.
