dRRT*: Scalable and Informed Asymptotically-Optimal Multi-Robot Motion Planning
Rahul Shome, Kiril Solovey, Andrew Dobson, Dan Halperin, Kostas E., Bekris

TL;DR
This paper introduces dRRT*, a scalable, informed, asymptotically-optimal multi-robot motion planner that efficiently finds high-quality paths in complex multi-robot scenarios, outperforming previous methods in scalability and convergence speed.
Contribution
It extends the prior dRRT algorithm to achieve theoretical optimality guarantees and incorporates heuristics for improved search efficiency in multi-robot configuration spaces.
Findings
dRRT* converges quickly to high-quality paths
The algorithm scales to more robots than previous methods
It successfully solves real-world multi-robot arm problems
Abstract
Many exciting robotic applications require multiple robots with many degrees of freedom, such as manipulators, to coordinate their motion in a shared workspace. Discovering high-quality paths in such scenarios can be achieved, in principle, by exploring the composite space of all robots. Sampling-based planners do so by building a roadmap or a tree data structure in the corresponding configuration space and can achieve asymptotic optimality. The hardness of motion planning, however, renders the explicit construction of such structures in the composite space of multiple robots impractical. This work proposes a scalable solution for such coupled multi-robot problems, which provides desirable path-quality guarantees and is also computationally efficient. In particular, the proposed \drrtstar\ is an informed, asymptotically-optimal extension of a prior sampling-based multi-robot motionā¦
| Number of nodes: = | 50 | 100 | 200 |
|---|---|---|---|
| -PRM* construction | 3.427 | 13.293 | 69.551 |
| -PRM* query | 0.002 | 0.005 | 0.019 |
| 2 -size PRM* construction | 0.135 | 0.274 | 0.558 |
| Implicit A* search over | 0.886 | 4.214 | 15.468 |
| Ā over (initial) | 1.309 | 0.999 | 0.638 |
| Ā over (initial) | 0.003 | 0.002 | 0.002 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
ā
11institutetext: Rahul Shome, and Kostas Bekris 22institutetext: Computer Science Dept. of Rutgers Univ., NJ, USA
22email: {rahul.shome, kostas.bekris}@cs.rutgers.com 33institutetext: Kiril Solovey, and Dan Halperin 44institutetext: Computer Science Dept. of Tel Aviv Univ., Israel
44email: {kirilsol, danha}@post.tau.ac.il 55institutetext: Andrew Dobson 66institutetext: Electrical Engineering and Computer Science Dept. of Univ. of Michigan, MI, USA
66email: [email protected]
dRRT*: Scalable and Informed Asymptotically-Optimal
Multi-Robot Motion Planning
Rahul Shome
āā
Kiril Solovey
āā
Andrew Dobson
āā
Dan Halperin
āā
Kostas E. Bekris A. Dobson, R. Shome and K. Bekris were supported by NSF IIS 1617744 and CCF 1330789.K. Solovey and D. Halperinās work has been supported in part by the Israel Science Foundation (grant no.Ā 825/15) and by the Blavatnik Computer Science Research Fund. Kiril Solovey has also been supported by the Clore Israel Foundation.
Abstract
Many exciting robotic applications require multiple robots with many degrees of freedom, such as manipulators, to coordinate their motion in a shared workspace. Discovering high-quality paths in such scenarios can be achieved, in principle, by exploring the composite space of all robots. Sampling-based planners do so by building a roadmap or a tree data structure in the corresponding configuration space and can achieve asymptotic optimality. The hardness of motion planning, however, renders the explicit construction of such structures in the composite space of multiple robots impractical. This work proposes a scalable solution for such coupled multi-robot problems, which provides desirable path-quality guarantees and is also computationally efficient. In particular, the proposed Ā is an informed, asymptotically-optimal extension of a prior sampling-based multi-robot motion planner, . The prior approach introduced the idea of building roadmaps for each robot and implicitly searching the tensor product of these structures in the composite space. This work identifies the conditions for convergence to optimal paths in multi-robot problems, which the prior method was not achieving. Building on this analysis, Ā is first properly adapted so as to achieve the theoretical guarantees and then further extended so as to make use of effective heuristics when searching the composite space of all robots. The case where the various robots share some degrees of freedom is also studied. Evaluation in simulation indicates that the new algorithm, ā converges to high-quality paths quickly and scales to a higher number of robots where various alternatives fail. This work also demonstrates the plannerās capability to solve problems involving multiple real-world robotic arms.
ā ā journal: AURO
1 Introduction
A variety of robotic applications, ranging from manufacturing to logistics and service robotics, involve multiple robotic systems operating in the same workspace. In traditional, industrial domains, such as car manufacturing, the environment is fully known and predictable. This allows the robots to operate in a highly scripted manner by repeating the same predefined motions as fast as possible. New types of tasks, however, require robotic manipulators that compute high-quality paths on the fly. For instance, a team of robotic arms can be tasked to pick and sort a variety of objects that are dynamically placed on a common surface. Multiple challenges need to be addressed in the context of such applications, such as detecting the configuration of the objects and grasping. This work deals with the multi-robot motion planning (MMP) problem (Wagner and Choset, 2013; Gravot and Alami, 2003a; Gharbi etĀ al, 2009) in the context of such setups, i.e., computing the paths of multiple, high-dimensional systems, such as robotic arms, that operate in a shared workspace, as shown in FigureĀ 1. The focus is to solve MMP in a computationally efficient way as well as in a coupled manner, which allows to argue about the quality of the resulting paths.
Planning for multiple, high-dimensional robotic systems is quite challenging. The motion planning problem is already computationally hard for a single robot (Canny, 1988) that is a kinematic chain of rigid bodies. Thus, most approaches for multi-robot motion planning either quickly become intractable as the number of robots increases or alternatively sacrifice completeness and path quality guarantees. In particular, problem instances are especially hard when the robots operate in a shared workspace and in close proximity. In this case, it is not easy to reason for the robots in a decoupled manner. Instead, it is necessary to operate in the composite configuration space of all robots. The space requirements, however, for solving motion planning instances increase exponentially with problem dimensionality. The composite space of all robots in MPP instances is typically very high-dimensional to explore in a comprehensive and resolution complete manner, such as discretizing it with a grid and searching over it.
Sampling-based planners aim to help with such dimensionality issues by approximating the connectivity of the underlying configuration space. They construct graph-based representations, such as a roadmap or a tree data structure, which store collision free configurations and paths through a sampling process. Under certain conditions regarding the density of the corresponding graph, sampling-based planners can provide desirable path quality guarantees. Specifically, they achieve asymptotic optimality, i.e., as the sampling process progresses, the best path on the graph converges to the optimal one in the original configuration space. Nevertheless, even sampling-based planners face significant space challenges in the context of MPP problem, such as the one shown in FigureĀ 1, which corresponds to a 28-dimensional space. In particular, it becomes infeasible with standard, asymptotically optimal sampling-based planners to explicitly store a graph in the corresponding space that will allow the discovery of a solution in practice. This is due to the large number of samples required to cover an exponentially larger volume as the dimensionality of the underlying space increases. Asymptotically optimal planners must maintain in the order of edges per sample, where is the number of samples. Thus, when planning for high-dimensional systems, the space requirements of the corresponding roadmaps surpass the capabilities of standard workstations rather quickly.
A previously proposed sampling-based planner specifically designed for multi-robot problems, called Ā (Solovey etĀ al, 2015a), achieved progress in this area by leveraging an implicit representation of the composite space in order to provide both completeness and efficiency. This implicit representation is a graph, which corresponds to the tensor product of roadmaps explicitly constructed for each robot. This allows finding solutions for relatively high-dimensional multi-robot motion planning problems. Nevertheless, this prior method did not provide any path quality guarantees.
One key contribution of this work is to show that the structure of this implicit representation is guaranteed (asymptotically) to contain the optimal path for a set of robots moving simultaneously. Nevertheless, defining an implicit graph that contains a high-quality solution does not guarantee that the final solution is optimal unless the search process over this graph is appropriate. While a provably optimal search approach, such as , could be implemented to search this graph, the extremely large branching factor of the implicit roadmap makes this prohibitively expensive, especially in the context of anytime planning. Instead, this work leverages the observation that a sampling-based method inspired by RRTā, which maintains a spanning tree over the underlying implicit graph, will return optimal solutions if it allows rewiring operations during the spanning tree construction. Namely, it must converge to the tree with all of the minimum-cost paths starting from the initial query state to each other node in the graph. Further, this work shows that for a broad range of cost functions over paths in this graph can be used while still guaranteeing the proposed Ā approach will asymptotically converge towards such a tree.
This paper is an extension of prior work (Dobson etĀ al, 2017), which introduced an initial version of the and the sufficient conditions for generating an asymptotically optimal planner in this context. The current manuscript provides the following extensions:
- ā¢
A more thorough analysis that shows that the desirable guarantee can be achieved for an additional distance metric for multi-robot motion planning;
- ā¢
A more detailed description of the method, which has been further improved for computational efficiency purposes through the appropriate incorporation of heuristics;
- ā¢
The method has been extended to handle systems with shared degrees of freedom, as shown in related work (Shome and Bekris, 2017).
- ā¢
The experimental section has been extended to include the new methods as well as demonstrations on physical platforms.
The following section summarizes related prior work on the subject before SectionĀ 3 introduces the problem setup. SectionĀ 4 describes the underlying structure of the implicit tensor-roadmap and the previous method Ā (Solovey etĀ al, 2015a). The changes to necessary to achieve asymptotic optimality and computational efficiency, which result to the proposed algorithm are presented in SectionĀ 5. An analysis of the properties of the method are showcased in SectionĀ 6. The method is extended, in SectionĀ 7 to systems with shared degrees of freedom. SectionĀ 8 evaluates the methods experimentally and demonstrates their performance.
2 Prior Work
The multi-robot motion planning problem (MMP) is notoriously difficult as it involves many degrees of freedom, and consequently a vast search space, as each additional robot introduces several additional degrees of freedom to the problem. Certain instances of the problem can be solved efficiently, i.e., in polynomial run time, and in a complete manner, at times even with optimality guarantees on the solution costsĀ (Turpin etĀ al, 2013; Adler etĀ al, 2015; Solovey etĀ al, 2015b). However, in general MMP is computationally intractableĀ (Hopcroft etĀ al, 1984; Spirakis and Yap, 1984; Solovey and Halperin, 2016; Johnson, 2018).
Decoupled MMP techniques (Erdmann and Lozano-Perez, 1987; Ghrist etĀ al, 2005; LaValle and Hutchinson, 1998; Peng and Akella, 2004; Van Den Berg and Overmars, 2005; Van DenĀ Berg etĀ al, 2009) reduce search space size by partitioning the problem into several subproblems, which are solved separately. Then, the different solutions are combined. These methods, however, typically lack completeness and optimality guarantees. While some hybrid approaches can take advantage of the inherent decoupling between robots and provide guarantees (Van DenĀ Berg etĀ al, 2009), they are often limited to discrete domains. The problem is more complex when the robots exhibit non-trivial dynamics (Peng and Akella, 2005). Collision avoidance or control methods can scale to many robots, but lack path quality guarantees (Van DenĀ Berg etĀ al, 2011; Tang and Kumar, 2015).
In contrast to that, centralized approaches (Kloder and Hutchinson, 2005; OāDonnell and Lozano-PĆ©rez, 1989; Salzman etĀ al, 2015; Solovey etĀ al, 2015a; Svestka and Overmars, 1998; Wagner and Choset, 2013) usually work in the combined high-dimensional configuration space, and thus tend to be slower than decoupled techniques. However, centralized algorithms often come with stronger theoretical guarantees, such as completeness. Through the rest of this section we will consider centralized methods, with an emphasis on sampling-based approaches.
Sampling-based algorithms for a single robot (Kavraki etĀ al, 1996; LaValle and Kuffner, 1999; Karaman and Frazzoli, 2011) can be extended to the multi-robot case by considering the fleet of robots as one composite robot (Gildardo, 2002). Such an approach suffers from inefficiency as it overlooks aspects of multi-robot planning, and hence can handle only a very small number of robots. Several techniques tailored for instances of MMP involving a small number of robots have been describedĀ (Hirsch and Halperin, 2004; Salzman etĀ al, 2015).
In previous workĀ (Solovey and Halperin, 2014), an extension of MMP was introduced, which consists of several groups of interchangeable robots. At the heart of the algorithm is a novel technique where the problem is reduced to several discrete pebble-motion problems (Kornhauser etĀ al, 1984; Luna and Bekris, 2011; Yu and LaValle, 2013). These reductions amplify basic samples into massive collections of free placements and paths for the robots. An improved versionĀ (Krontiris etĀ al, 2015) of this algorithm applied it to rearrange multiple objects using a robotic manipulator.
Previous workĀ (Svestka and Overmars, 1998) introduced a different approach, which leverages the following fundamental observation: the structure of the overall high-dimensional multi-robot configuration space can be inferred by first considering independently the free space of every robot, and combining these subspaces in a meaningful manner to account for robot-robot collisions. They suggested an approach which combines roadmaps constructed for individual robots into one tensor-product roadmap , which captures the structure of the joint configuration space (see more information in SectionĀ 4).
Due to the exponential nature of the resulting roadmap, this technique is only applicable to problems that involve a modest number of robots. A recent workĀ (Wagner and Choset, 2013) suggests that does not necessarily have to be explicitly represented. They apply their Ā algorithm to efficiently retrieve paths over , while minimizing the explored portion of the roadmap. The resulting technique is able to cope with a large number of robots, for certain types of scenarios. However, when the degree of simultaneous coordination between the robots increases, there is a sharp increase in the running time of this algorithm, as it has to consider many neighbors of a visited vertex of . This makes less effective when the motion of multiple robots needs to be tightly coordinated.
Recently a different sampling-based framework for MMP was introduced, which combines an implicit representation of with a novel approach for pathfinding in geometrically-embedded graphs tailored for MMPĀ (Solovey etĀ al, 2015a) . The discrete-RRT () algorithm is an adaptation of the celebrated Ā algorithm for the discrete case of a graph, and it enables a rapid exploration of the high-dimensional configuration space by carefully walking through an implicit representation of the tensor product of roadmaps for the individual robots (see extensive description in SectionĀ 4). The approach was demonstrated experimentally on scenarios that involve as many as DoFs and on scenarios that require tight coordination between robots. On most of these scenarios Ā was faster by a factor of at least ten when compared to existing algorithms,including the aforementioned .
Later, Ā was applied to motion planning of a free-flying multi-link robotĀ (Salzman etĀ al, 2016). In that case, Ā allowed to efficiently decouple between costly self-collision checks, which were done offline, and robot-obstacle collision checks, by traversing an implicitly-defined roadmap, whose structure resembles to that of . Ā has also been used in the study of the effectiveness of metrics for MMP, which are an essential ingredient in sampling-based plannersĀ (Atias etĀ al, 2017).
The current work proposes Ā and shows that it is an efficient asymptotically optimal extension of the previously proposed . The Ā framework is an anytime algorithm, which quickly finds initial solutions and then refines them, while ensuring asymptotic convergence to optimal solutions. Simulations show that the method practically generates high-quality paths while scaling to complex, high-dimensional problems, where alternatives fail.
3 Problem Setup and Notation
We start with a definition of the problem. Consider a shared workspace with holonomic robots, each operating in a -dimensional configuration space for . For a given robot , denote its free space, i.e., the set of all collision free configurations, by , and the obstacle space by .
The composite configuration space is the Cartesian product of each robotās configuration space. That is, a composite configuration is an -tuple of robot configurations. For two distinct robotsĀ , denote by the set of configurations of robot , which lead into collision with robot at its configuration . Then, the composite free space consists of configurations in which robots do not collide with obstacles or pairwise with each other. Formally:
for every ;
for every .
The composite obstacle space is defined as .
Multi-robot planning is concerned with finding (collision-free) composite trajectories of the form . is an -tuple of single-robot trajectories .
This work is concerned with producing high-quality trajectories, which minimize certain cost functions. In particular, we consider three cost functions , which are presented below. Let be a composite trajectory. For the following, denotes the standard arc length of a curve:
The sum of path lengths: .
The maximum path length: .
The Euclidean arc length of :
This work presents Ā as an efficient, anytime solution to the robustly-feasible composite motion planning (RFCMP) problem :
Definition 1 (RFCMP)
Given robots operating in composite configuration space , and for a given query , an RFCMP problem is one which yields a robustly-feasible trajectory and . Namely, there exists a fixed constant such that it holds that
[TABLE]
One of the primary objectives of this work is to provide asymptotic optimality in the composite configuration space without explicitly constructing a planning structure in this space.
Definition 2 (Asymptotic Optimality)
Let be the time budget of the algorithm and a robustly optimal solution of cost is returned after time , then asymptotic optimality is defined as ensuring that the following holds true for any .
[TABLE]
4 Algorithmic Foundations
This section provides a detailed description of the discrete-Ā () methodĀ (Solovey etĀ al, 2015a), which is the basis of our method presented in SectionĀ 5. Ā was posed as an efficient way to search an implicitly defined tensor-product roadmap, which captures the structure of without explicitly sampling this space.
4.1 Tensor-product roadmap
Here we provide a formal definition of the tensor-product roadmap that Ā is designed to explore. For every robot construct a PRMĀ graphĀ (Kavraki etĀ al, 1996), denoted by , which is embedded in . That is, can be viewed as an approximation of and encodes collision free motions for robot . The construction of is determined by two parameters and , which represent the number of samples, and the connection radius, respectively. As will be discussed in the following sections, it is necessary the roadmaps to be constructed with certain range of parameters to guarantee asymptotic optimality of the new planners (SectionĀ 5).
Define the tensor-product roadmap, denoted by , as the tensor product between (see FigureĀ 2). Each vertex of describes a simultaneous placement of the robots, and similarly an edge of describes a simultaneous motion of the robots. Formally, is the Cartesian product of the nodes from each roadmapĀ . For two vertices , the edge set contains edge if or .111Notice this difference from the original Ā (Solovey etĀ al, 2015a) so as to allow edges where some robots remain motionless. Note that by the definition of , the motion described by each edge represents a path for the robots in which the robots do not collide with obstacles. However, collisions between pairs of robots still may be possible.
It is important to note that the tensor-product roadmap has vertices. Given the neighborhood of a node in as , the size of the neighborhood of a node in is . Using the much smaller to construct online is computationally beneficial.
The presented algorithms share a common set of input and output parameters, such as the configuration space decompositions, which are predefined. In practice, the algorithms use pre-computed roadmaps in each constituent space online. The collision volumes that correspond to the robot and obstacles in the scene are also used online for validation. The algorithms output a trajectory in the configuration space of all robots, which is collision free with all obstacles and among robots.
4.2 Discrete RRT
An explicit construction of is possible in very limited settings that either involve few robots, e.g., , or when the underlying single-robot roadmaps have few vertices and edges. However, in general it is prohibitively costly to fully represent it due to its size, which grows exponentially with the number of robots, in terms of the number of vertices. Moreover, in some cases it may be even a challenge to represent all the edges adjacent to a single vertex of , as there may be exponentially many of those.
The Ā algorithm enjoys the rich structure that offers (see SectionĀ 6) without explicitly representing it. In particular, it gathers information on only from the single-robot roadmaps .
Similarly to the single-robot planner Ā (LaValle and Kuffner, 1999), Ā grows a tree rooted at the start state of the given query (LineĀ 1). Ā restricts the growth of its tree to the tensor-product roadmapĀ in contrast to , which explores the entire space . That is, is a subgraph of , and .
The high level operations of the Ā approach are outlined in AlgorithmĀ 1. The approach will iterate until a solution is found or the time limit is exceeded (AlgorithmĀ 1, LineĀ 2), beginning with performing a fixed number expansion steps at the beginning of each iteration (LinesĀ 3,Ā 4). This expansion process is outlined in AlgorithmĀ 2. The approach then checks to see if there is a connected path to the target (LineĀ 5), and once a path is found, it is returned (LinesĀ 6,Ā 7).
The expansion procedure begins by drawing a random sample (LineĀ 1). It then finds the nearest neighbor in the tree (LineĀ 2) and then selects a neighbor , such that , according to a direction oracle function (LineĀ 3). Then, if is not in the tree (LineĀ 4), it is added to the tree (LineĀ 5) and an edge from to is also added (LineĀ 6).
We now elaborate on . Given , the oracle returns a vertex that is the neighbor of (in ) found in the direction of . The crux of the approach is that can come up with such a neighbor efficiently without relying on explicit representation of . Let and define to be the ray through starting at . Then, denote as the minimum angle between and . Denote by the set of neighbor nodes of in , i.e., for every it holds that . Then
[TABLE]
The implementation of (Algorithm 5) proceeds in the following manner (see a two-robot case illustrated in Figure 3). Let . For every robot , the oracle extracts from the neighbor of , which minimizes the expression . Notice that such a search can be performed efficiently as it only requires to traverse all the neighbors of in . The combination of all yields .
As in , has a Voronoi-bias property Ā (Lindemann and LaValle, 2004). Showing that exhibits Voronoi bias is slightly more involved compared to the basic . This is illustrated in Figure 4. To generate an edge , random sample Ā must be drawn within the Voronoi cell of , denoted as (FigureĀ 4(A)) and in the general direction of , denoted as (FigureĀ 4(B)). The intersection of these two volumes: , is the volume to be sampled so as to generate via Ā as shown in Figure 4.
The high-level loop of the algorithm remains similar across the method variants. The input parameter denotes how many times the tree is expanded before the algorithm checks whether a solution has already been discovered. If , this check is performed every iteration. If tracing the path is an expensive operation - typically it corresponds to a heuristic search process over the tensor product roadmap - then the implementer can choose to use a higher value.
5 Asymptotically Optimal Discrete RRT
This section outlines two versions of the proposed asymptotically optimal variant of the method. The first is a simple uninformed approach, which relies on the fact that to provide asymptotic optimality, it is sufficient to use a simple rewiring scheme. This simplified version will be called the asymptotically-optimal (). For the sake of algorithmic efficiency however, a second, more advanced version is also proposed referred to as . To summarize the algorithmic contributions of the current work over the original :
- ā¢
performs a rewiring step to refine paths in the tree, reducing costs to reach particular nodes.
- ā¢
is anytime, employing branch and bound pruning after an initial solution is reached.
- ā¢
promotes progress towards the goal during tree node selection.
- ā¢
employs an informed expansion procedure capable of using heuristic guidance.
5.1 ao-
This section outlines , an asymptotically optimal version of the algorithm which has been minimally modified to guarantee asymptotic optimality. At a high-level the approach uses a tree re-wiring technique reminiscent of Ā (Karaman and Frazzoli, 2011).
AlgorithmĀ 3 outlines which iteratively expands a tree over given a time budget (AlgorithmĀ 3, LineĀ 2), performing consecutive calls to (LinesĀ 3,Ā 4). Then, the method attempts to trace the path which connects the start with the target (LineĀ 5). If such a path is found and is better than , it replaces (LinesĀ 6,Ā 7). is returned after the time limit is reached (LineĀ 8).
The expansion procedure for is very similar to the original method, and is outlined in AlgorithmĀ 4. It begins by drawing a random sample in the composite configuration space (LineĀ 1), and then finds the nearest neighbor to this sample in the tree (LineĀ 2). It then selects a neighbor according to the oracle function (AlgorithmĀ 5). This is the same oracle that is used in that tries to select a neighbor of most in the direction of . Then, if is not in the tree (LineĀ 4), it is added to the tree (LineĀ 5) and an edge from to is also added (LineĀ 6). Where this expansion step differs is that if is already in the tree (LineĀ 7), the method performs a rewiring step (LineĀ 8) to check to see if the path to is of lower cost than the existing one.
The method would be similar to in terms of the samples that constitute the tree, however improves the solution cost with iterations and finds better solutions compared to . It is however desirable to focus the search in order find the initial solution quickly, while preserving solution quality improvement over time.
5.2
The main body of the informed Ā algorithm is provided in AlgorithmĀ 6. The proposed method is an improvement on top of , that preserves the asymptotic optimality while benefiting computationally from branch-and-bound pruning once a solution is found, greedy child propagations during node selection, and heuristic guidance during expansions.
The key insight behind the algorithmic improvements is the fact that by virtue of the structure of the tensor-product roadmap , there readily exists a usable heuristic measure in the constituent roadmaps . The shortest path on a constituent roadmap to the goal can be used as a heuristic to guide the tree. If there is no robot interaction introduced by the individual robot shortest paths, such a path comprising of the individual shortest paths is a solution that suffices. In cases of interaction between the robots, a shortest path is expected to deviate locally in regions of interaction. The best a robot can do from any constituent roadmap vertex is to follow the shortest path to the goal on the constituent roadmap. Although domain-specific heuristics can be also applied to the algorithm, it should be noted that in the currently proposed method, the purpose of the heuristic is to primarily discover an initial solution as quickly as possible. This in turn helps branch-and-bound kick in and further focuses the search once a bounding cost is ascertained from the initial solution.
At a high level, AlgorithmĀ 6 follows the structure of . The only change is that the outer loop keeps track of the tree node being added as and passes it on to the next call to the subroutine. The use of this information to apply heuristic guidance is detailed in the description of the function.
AlgorithmĀ 7 outlines the expansion step. The default behavior is summarized in AlgorithmĀ 7, LinesĀ 1-3, i.e., when no is passed as argument (LineĀ 1). This operation corresponds to an exploration step similar to , i.e., a random sample is generated in (LineĀ 2) and its nearest neighbor in is found (LineĀ 3). If the last iteration generated a node that was closer to the goal relative to its parent, then is provided to the function. In this case (LineĀ 4-6) is set as the target configuration , and is selected as . This constitutes the greedy child propagations which promotes progress towards the goal.
Informed Expansion : The expansion procedure in AlgorithmĀ 8 replaces the oracle in . It switches between distinct guided and exploratory behaviors according to whether attempts to drive the expansion towards the target or not. When the method uses heuristic guidance, among all the neighbors of on a constituent roadmap , , the one with the best heuristic measure is selected. During the exploration phase, the method selects a random neighbor out of .
In either case, the oracle function returns to AlgorithmĀ 7 the implicit graph node Ā that is a neighbor of Ā on the implicit graph (LineĀ 7). Then the method finds neighbors , which are adjacent to Ā in and have also been added to (LineĀ 8). Among , the best node is chosen as the node to connect according to cost measure. Such an operation might yield no valid parent due to collisions along . In such a case (LineĀ 10) the method fails to add a node during the current iteration. LineĀ 11 implements a branch-and-bound based on the cost of the best solution so far.
LinesĀ 12-15 recount the tree addition and parent rewire process. LinesĀ 16-17 perform an additional rewiring step in the neighborhood if is a better parent of any of the neighboring nodes. Line 19 switches the child promotion by checking whether made progress toward the goal according to the heuristic measure. The method ensures that in this case (or ) is a child-promoted node, which would be selected during the next iteration. This effect of this behavior is that if the uncoordinated individual shortest paths are collision free, this would be greedily attempted first from the child-promoted nodes added to the tree. Evaluations indicate that this proves very effective in practice.
It should be noted that all candidate edges in Line 9 and 17 are collision-free and for the sake of algorithmic clarity, collision checking has been assumed to be encoded into the steering function and this is enforced during tree additions and rewires. Any specialized sampling behavior is assumed to be part of the implementation of the subroutine .
Notes on Implementation: In the implementation, the heuristic measure is efficiently calculated by precomputing all-pair shortest paths on the constituent graphs with Johnsonās algorithm (Johnson, 1977), which runs in . Precomputing the heuristic measure alleviates any overhead of spending online computation time. It is proposed that for large graphs, the vertices can be subsampled and the heuristic estimated for representative nodes that approximate the value in their neighborhoods. The neighbor with the best value can be computed once for a given target , and reused during the iterations inside . In AlgorithmĀ 7 (Line 19), refers to a heuristic estimate in the composite space. This can be deduced from the constituent spaces. The method also included additional focused random sampling (Gammell etĀ al, 2015) once a solution is found to aid in convergence. For a fraction of the ārandom samplesā, goal biasing samples the target state for a robot.
The set returns the set of neighbors on the graph for a vertex, in addition to the vertex itself. This ensures that it is possible for a robot to stay static during an edge expansion. This means that the algorithm is also able to discover solutions where a subset of the robots must remain stationary for a period of time.
6 Analysis
This section examines the properties of Ā starting with the asymptotic convergence of the implicit roadmap to containing a path in with optimum cost. Then, it is shown that the online search eventually discovers the shortest path in . The combination of these two facts proves the asymptotic optimality of and .
For simplicity, the analysis considers robots operating in Euclidean space, i.e., is a -dimensional Euclidean hypercube for fixed . Robots are assumed to have the same number of degrees of freedom . The results can relate to a large class of systems, which are locally Euclidean (see, Dobson and Bekris (2013)). This is applicable to all the systems under consideration in the paper, including manipulators, with bounded angular degrees of freedom. Analysis of systems, which are not locally Euclidean, requires additional rigor especially regarding the definition of the cost metric. The Discussion section includes a description of a possible extension of the presented analysis to non-holonomic systems. It is acknowledged that the arguments presented in the current section will not readily transfer to such systems.
6.1 Optimal Convergence of
In this section we prove that when the connection radius 222In the graphs considered here, an edge exists between two nodes, if the nodes are separated by a distance less than the connection radius . used for the construction of the single-robot PRMĀ roadmaps is chosen in a certain manner, this yields a tensor-product graph , which contains asymptotically optimal paths for MMP.
Definition 3
A trajectory is robust if there exists a fixed such that for every it holds that , where denotes the standard Euclidean distance.
Definition 4
Let cost be one of the cost functions defined in SectionĀ 3. A value is robust if for every fixed , there exists a robust path , such that . The robust optimum , is the infimum over all such values.
For any fixed , and a specific instance of constructed from roadmaps, having samples each, denote by the lowest-cost path (with respect to ) from to overĀ .
Definition 5
is asymptotically optimal (AO) if for every fixed it holds that asymptotically almost surely333Let be random variables in some probability space and let be an event depending onĀ . occurs asymptotically almost surely (a.a.s.) if ., where the probability is over all the instantiations of with samples for each PRM.
Using this definition, the following theorem is proven. Recall that denotes the dimension of a single-robot configuration space.
Theorem 6.1
* is AO when*
[TABLE]
where where is any constant larger than [math], is the volume measure and is the volume of an unit hyperball in .
Since the method deals with solving the problem of finding a robust optimum solution, some is fixed. By definition of the problem, there exists a robust trajectory , and a fixed , such that . Additionally for every it holds that .
If it can be shown that contains a trajectory , such that444The small-o notation indicates a function that becomes smaller than any positive constant and thereby asymptotically will become negligible. When this relation holds, the positive constant corresponds to .:
[TABLE]
a.a.s., this would imply that , proving TheoremĀ 6.1.
As a first step, it will be shown that the robustness of in the composite space implies robustness in the single-robot setting, i.e., robustness along .
For define the forbidden space parameterized by as
[TABLE]
Claim 1
For every robot , , and , , i.e., the robustness of in the composite space implies robustness over all single-robot paths .
Proof
Fix a robot , and fix some and a configuration . Next, define the following composite configuration
[TABLE]
Note that it differs from only in the -th robotās configuration. By the robustness of it follows that
[TABLE]
The result of Claim 1 is that the paths are robust in their individual spaces w.r.t the parameterized forbidden space . This means that there is sufficient clearance for the individual robots to not collide with each other given a fixed location of a single robot.
Next, a Lemma is derived using proof techniques from the literatureĀ (Janson etĀ al, 2015, TheoremĀ 4.1), and it implies every contains a single-robot path that converges to .
Lemma 1
For every robot , let be constructed with samples and a connection radius . Then it contains a path with the following attributes a.a.s.:
- (i)
, ;
- (ii)
;
- (iii)
, s.t. , where Im is the function image.
Proof
The first two properties of Lemma (i) and (ii) restate Ā (Janson etĀ al, 2015, TheoremĀ 4.1), which is applicable to the setup of this work. The last property (iii) is an immediate corollary of the first two: due to the fact that is obtained from , every point along the path is either a vertex of the graph, or lies on a straight-line path (i.e., an edge) between two vertices, whose length is at most .
To complete the proof of TheoremĀ 6.1 it remains to be shown that the combination of yields the trajectory, of a desired cost, i.e., one that conforms to EquationĀ 1. The bound derived in LemmaĀ 1 (ii) looks like what we need for proving TheoremĀ 6.1. Even though a similar bound exists in the individual spaces, it needs to be shown that EquationĀ 1 holds for a cost function in . We proceed to show this individually for the different cost functions.
6.1.1 Optimal Convergence for a Linear combination of Euclidean arc lengths
Lemma 2
Given LemmaĀ 1 (ii), EquationĀ 1 holds for a cost function that is a linear combination of Euclidean arc lengths
Proof
Here consider the case that , which can also be easily modified for or some arbitrary linear combination of the arc lengths.
In particular, define , where are obtained from LemmaĀ 1. Then
[TABLE]
Lemma 3
A path exists, that satisfies the properties of LemmaĀ 1, and is collision free both in terms of robot-obstacle and robot-robot.
Proof
Every constituent roadmap of samples is constructed to satisfy LemmaĀ 1 and contains individual robot paths . defines the tensor-product graph in the composite configuration space . The path is a combination of the individual robot paths . LemmaĀ 1 implies that contains a path in , that represents collision-free motions relative to obstacles, and minimizes the cost function. Nevertheless, it is not clear whether this ensures the existence of a path where robot-robot collisions are avoided. That is, although , it might be the case that . Next, it is shown that can be reparametrized to induce a composite-space path whose image is fully contained in , with length equivalent to .
For each robot , denote by the chain of vertices traversed by . For every assign a timestamp of the closest configuration along , i.e.,
[TABLE]
Also, define and denote by the ordered list of , according to the timestamp values. Now, for every , define a global timestamp function , which assigns to each global timestamp in a single-robot configuration from . It thus specifies in which vertex robot resides at time . For , let be the largest index, such that . Then simply assign . From property (iii) in LemmaĀ 1 and ClaimĀ 1 it follows that no robot-robot collisions are induced by the reparametrization. This concludes the proof of TheoremĀ 6.1.
6.1.2 Optimal Convergence for Euclidean arc length
Arguments for convergence of the cost of the solution in terms of the Euclidean arc length of the composite path can be made to extend the results of LemmaĀ 2. A robot having DoFs, exists in an space. The motion of the robot constitutes a curve in , defined as
[TABLE]
where is the coordinate function of the curve along the DoFĀ , where and . The Euclidean arc-length of the path in :
[TABLE]
The coordinate function is assumed to be continuous with respect to the Lebesgue measure on and of bounded variation. The Lebesgue measure assigns a measure to subsets of -dimensional Euclidean space. For or , it coincides with the standard measure of length, area, or volume. The assumption here is that the curve is Lebesgue integrable over the coordinate function. This relates to the variation of the curve being smooth for the parametrization, over subsets of the curve that correspond to the Lebesgue measure over . is also a rectifiable curve, i.e., the curve has finite length.
Definition 6
The partial arc length is defined as,
[TABLE]
By definition and assuming smoothness and bounded variation (Pelling, 1977), for to
[TABLE]
The value is the supremum over all possible finite partitions , that can divide . This generates a finite set of parts.
We denote the value of for some partition as:
[TABLE]
A part shall refer to the curve between and . The measure of the part would be
[TABLE]
Let be the finite supremum partitioning over , that has parts. This means, . Without loss of generality, let us assume that corresponds to the supremum partitioning that has the least number of parts, , i.e., there are no degenerate partitions. A finer partition can introduce additional parameterization over , and hence is a superset of , but cannot increase the value of since is the supremum. Note that a partition sequence with parts will have a cardinality of . The finer partition has all these parametrization values in addition to others.. Since is finite and is finite, .
Claim 2
Given a finite set of paths , there exists a finer partitioning, for over each , which yields number of equal measure parts (Figure 6) for every .
Proof
[TABLE]
This holds true for every for a corresponding . The measure of every part is equal, and is denoted by . This simplifies EquationĀ 5 to .
Claim 3
Additionally, by this simplification, EquationĀ 5 in the composite space is restated for where .
Proof
In the multi-robot space Euclidean space , the arc length in the composite space can be expressed in terms of the arc lengths traversed in the individual robot spaces.
[TABLE]
where, with a slight abuse of notation is a shorthand representation for some .
Lemma 4
For a , where is obtained from LemmaĀ 1, given that , EquationĀ 1 holds for the Euclidean arc lengths.
Proof
Partitioning the arcs , and , into (chosen as per ClaimĀ 2) pieces of equal length, yields two trajectory sequences, for .
The high level idea is that leveraging the uniformity in the parameterized parts introduced by , LemmaĀ 1(ii) has to be recombined to represent the Euclidean arc length in the composite space.
[TABLE]
Following the same parameterization described in LemmaĀ 3, TheoremĀ 6.1 can be shown for the Euclidean metric as well.
6.2 Asymptotic Optimality of
Finally, Ā is shown to be AO. Denote by the time budget in AlgorithmĀ 6, i.e., the number of iterations of the loop. Denote by the solution returned by for samples in the individual constituent roadmaps and iterations of the Ā algorithm.
Theorem 6.2
If then for every fixed it holds that
[TABLE]
Since is AO (TheoremĀ 6.1), it suffices to show that for any fixed , and a fixed instance of , defined over PRMs with samples each, eventually (as tends to infinity), finds the optimal trajectory over . This property is stated in LemmaĀ 5 and proven subsequently. The same arguments hold for both and , with the difference highlighted explicitly in the proof.
Lemma 5 (Optimal Tree Convergence of )
Consider an arbitrary optimal path originating from and ending at , then let be the event such that after iterations of , the search tree contains the optimal path up to segment . Then,
[TABLE]
Proof
This is shown using Markov chain results (Grinstead and Snell, 2012, TheoremĀ 11.3). Specifically, absorbing Markov chains can be leveraged to show that will eventually contain the optimal path over . An absorbing Markov chain has some subset of its states in which the transition matrix only allows self-transitions.
The proof follows by showing that the method can be described as an absorbing Markov chain, where the target state of a query is represented as an absorbing state in a Markov chain. For completeness, the theorem is re-stated here.
Theorem 6.3 (Thm 11.3 in Grinstead & Snell)
In an absorbing Markov chain, the probability that the process will be absorbed is 1 (i.e., as ), where is the transition submatrix for all non-absorbing states.
The first part is that the search is cast as an absorbing Markov chain, and second, that the transition probability from each state to the next is nonzero, i.e., each state eventually connects to the target.
For query , let the sequence of length represent the vertices of corresponding to the optimal path through the graph which connects these points, where corresponds to the target vertex, and furthermore, let be an absorbing state. TheoremĀ 6.3 works under the assumption that each vertex is connected to an absorbing state .
Then, let the transition probability for each state have two values, one for each state transitioning to itself, which corresponds to the search expanding along some other arbitrary path. The other value is a transition probability from to . This corresponds to two slightly different cases for and .
Case : The transition probability from to corresponds to the method sampling within the volume . Then, as the second step, it must be shown that this volume has a positive probability of being sampled in each iteration. It is sufficient then to argue that . Fortunately, for any finite , previous work has already shown that this is the case given general position assumptions (Solovey etĀ al, 2015a, LemmaĀ 2).
Case : In the case of due to the random neighborhood selection in the expansion , there is a positive transition probability from to .
Given these results, the is cast as an absorbing Markov chain, which satisfies the assumptions of Theorem 6.3, and therefore, the matrix . This implies that the optimal path to the goal has been expanded in the tree, and therefore \liminf_{m\to\infty}\mathbb{P}\big{(}O^{(m)}_{t}\big{)}=1.
7 Extension to Shared Degrees of Freedom
This section describes an extension of the approach to systems with shared degrees of freedom (DoF), with specific focus on humanoid robots with two arms. The challenge here arises because of the high dimensionality of the robots. The shared DoFĀ is a general formulation, which can refer to either degrees of freedom in a torso or a mobile base etc.
This section is structured in the same way as the rest of the algorithmic descriptions, and a lot of the shared notations and details are omitted for the sake of brevity. Instead, the interesting insights into the problems that arise due to the shared DoFĀ are highlighted, and resolved. A high level overview of the differences of dual-arm () from the previously stated methods includes:
- ā¢
decomposes the space by grouping the shared DoFĀ with one of the arms.
- ā¢
implicitly builds two trees online that explores two tensor roadmaps.
- ā¢
needs additional arguments for proving robustness in Claim 1.
The current work does not get into aspects related to manipulation. Nevertheless, the primitives designed here can speed up dual-arm manipulation task planning, where computational benefits can be achieved by operating over multiple roadmaps (Gravot et al, 2002; Gravot and Alami, 2003b). The topology of dual-arm manipulation has been formalized (Koga and Latombe, 1994; Harada et al, 2014) and extended to the -arm case (Dobson and Bekris, 2015). It requires the consideration of multi-robot grasp planning (Vahrenkamp et al, 2010; Dogar et al, 2015), regrasping (Vahrenkamp et al, 2009), as well as closed kinematic chain constraints (Cortés and Siméon, 2005; Bonilla et al, 2017). Furthermore, force control strategies are helpful for multi-arm manipulation of a common object (Caccavale and Uchiyama, 2008). Recently coordinated control has been applied to solve human-robot interaction tasks (Sina Mirrazavi Salehian et al, 2016).
The algorithm is meant to address the applicability of to high dimensional humanoid robots with shared DoF.
7.1 Problem Setup and Notation
As shown in Fig. 7, the DoF can be grouped into left, right, and shared DoFĀ subsets, so that: . A candidate solution path can be decomposed to projections along , and respectively.
The method proposes the construction of the following roadmaps, as shown in Fig. 8:
- ā¢
A left-shared and a right-shared DoFĀ roadmap , where and . The edges are collision-free paths in the same spaces, i.e., no collisions with the static geometry, or self-collisions among the arm or the shared DoFs.
- ā¢
A left arm and a right arm roadmap , such that , and . These roadmaps do not consider the static geometry as they are not grounded by the shared DoFs. So, only self-collisions between arm links are avoided.
The method focuses on two tensor product roadmaps: , and . The method then simultaneously searches over and in a -esque fashion.
7.2 Methodology
This section describes the proposed method, and the way the builds a forest of two trees , which explores both Ā and . In terms of the methodās properties it is sufficient to consider only one roadmap, but in practice, exploring them simultaneously helps in the convergence, since we can evaluate more possible solutions and rewires. The approach shows faster convergence compared to in , and scales more than .
At a high-level, the proposed Dual-arm Ā () simultaneously explores the tensor product roadmaps Ā and , by building a search tree for each one so as to find a solution from the start configuration to the target configuration . For every vertex, the algorithm keeps track from which tensor product roadmap the vertex belongs to. Upon initialization, the tree starts with two vertices, and , one corresponding to tensor product roadmap and the other to . Then, at every iteration, the tree data structure is expanded by adding a new edge and a node by calling an expand subroutine like Algorithm 7. The differences arises in the neighborhood calculation in Algorithm 7 LineĀ 8. The neighborhood for considers the tensor roadmap neighborhoods that are part of the tree for both roadmaps. belongs to to either or . is chosen to be the nearest tree vertex that was generated on the other tensor roadmap. is the set of all tree vertices that are tensor roadmap neighbors of or . While doing rewires, care is taken to only rewire nodes belonging to the same tensor roadmap. The consideration of a richer neighborhood lets the algorithm ensure adequate exploration of both tensor roadmaps. The informed oracle is similar to Algorithm 8 with the difference arising for the constituent roadmap , where the estimate is simply the shortest Euclidean distance to the goal.
Notes on Efficiency: The difference of the decomposition for shared degrees of freedom compared to is that does not give two kinematically independent spaces. Specifically, depends on the shared to be grounded to the frame of the robot. This means that the heuristic is less informed for and can only use the straight line distance. The algorithm does not work out of the box in the case of robots with shared degrees of freedom. The effect of the less expressive heuristic in , translates into some degradation in performance relative to the case of two kinematically independent robotic arms. Nevertheless, Ā is still significantly faster than operating directly in the composite space of the entire robot. There are not many methods that can practically compute solutions for such high-dimensional (e.g., 15 degrees of freedom) systems with kinematic dependences. The proposed Ā method preserves some of the scalability benefits of and addresses the kinematic dependence that arises for many popular humanoid robots.
7.3 Analysis
**Asymptotic optimality of tensor roadmaps : ** Given the decomposition, is divided into two parts: and . If a robust optimal path exists in , most of the arguments of Section 6 still hold for this decomposition. Due to the nature of the space decomposition, since the constituent spaces do not correspond to kinematically independent robots, the clearance assumption in Claim 1 needs to be reworked.
Claim 4
Robustness in implies robustness in and . For every decomposition, , and , .
Proof
Consider any , where is a configuration in so that the right arm collides either with the static geometry or with the left-shared part of the robot, which is at . Given a robust , is a colliding configuration: . But and only differ in , so the path has clearance
[TABLE]
By switching the decomposition of in into , by the above reasoning:
[TABLE]
This proves the robustness for . The same reasoning can be applied to and .
It suffices to follow the proof structures outlined in Section 6 to argue asymptotic optimality for the method. It should be noted that due to the coupled nature of introduced by the shared , the use of the Euclidean cost metric is more applicable.
8 Experimental Validation
This section provides an experimental evaluation of Ā by demonstrating practical convergence, scalability for disk robots, and applicability to dual-arm manipulation. The choice of a cost metric depends on the type of application and the underlying system properties. For systems without shared degrees of freedom, the considered cost function is the sum of individual Euclidean arc lengths, which is a popular choice for multi-robot systems. For systems with shared degrees of freedom, the combined nature of the underlying configuration space motivates the use of Euclidean arc length in the composite space as the metric. The results show that the properties and benefits of the proposed algorithms stay robust for both choices of cost functions.
8.1 Tests on Systems without Shared DoF
The approach and alternatives are executed on a cluster with Intel(R) Xeon(R) CPU E5-4650 @ 2.70GHz processors, and 128GB of RAM. The solution costs are evaluated in terms of the sum of Euclidean arc lengths.
2 Disk Robots among 2D Polygons: This base-case test involves disks () of radius with bounded velocity, in a region, inflated by the radius, as in FigureĀ 9. The disks have to swap positions between and . This is a setup where it is possible to compute the explicit roadmap, which is not practical in more involved scenarios. In particular, Ā is tested against: a) running on the implicit tensor roadmap (referred to as āImplicit ā), where is defined over the same individual roadmaps with nodes as those used by ; b) an explicitly constructed Ā roadmap with nodes in ; and c) the variant of the algorithm.
Results are shown in FigureĀ 10. Ā converges to the optimal path over , similar to the one discovered by Implicit , while quickly finding an initial solution of high quality. Furthermore, the implicit tensor product roadmap is of comparable quality to the explicitly constructed roadmap. The convergence of is faster compared to corresponding variant as evident from FigureĀ 10(left).
TableĀ 1 presents running times. Ā and implicit Ā construct -sized roadmaps (rowĀ 3), which are faster to construct than the Ā roadmap in (rowĀ 1). Ā becomes very costly as increases. For , the explicit roadmap contains vertices, taking GB of RAM to store, which was the upper limit for the machine used. When the roadmap can be constructed, it is fast to query (rowĀ 2). Ā quickly returns an initial solution (rowĀ 5), at par with the solution times from the explicit roadmap and well before Implicit Ā returns a solution (rowĀ 4). The initial solution times are compared visually in FigureĀ 10 which demonstrates the efficiency of compared to as well. The next benchmark further emphasizes this point.
The comparison between the early solution time required to find a suboptimal solution by the proposed method against the computation time needed by the optimal Ā highlights the impact of roadmaps of increasing sizes. While ās initial solution times barely change, the time taken by any variant of heuristic search over the composite roadmap increases with the size of the roadmap. This indicates that roadmaps of size similar to the tensor roadmaps considered here would rapidly cease to be solvable without anytime performance similar to that of .
Many Disk Robots among 2D Polygons: In the same environment as above, the number of robots is increased to evaluate scalability. The same environment is maintained in this benchmark to introduce additional complexity purely in terms of the addition of more robots into the planning problem. The effect of more difficult and practical planning scenarios would be explored in the subsequent benchmarks with manipulators. Each robot starts on the perimeter of the environment and is tasked with reaching the opposite side. An roadmap is constructed for every robot. It quickly becomes intractable to construct a Ā roadmap in the composite space of many robots.
FigureĀ 11 shows the inability of alternatives to compete with Ā in scalability. Solution costs are normalized by an optimistic estimate of the path cost for each case, which is the sum of the optimal solutions for each robot, disregarding robot-robot interactions. The colored vertical bars represent the range of the average initial and final solution costs. Implicit Ā fails to return solutions even for 3 robots. Directly executing Ā in the composite space fails to do so for . The original Ā method (without the informed search component) starts suffering in success ratio for and returns worse quality solutions than . The variant performs similar to in terms of success ratio but expectedly finds better solutions than . finds solutions up to .
In order to give an estimate of the immensity of the size of the search space, for , the tensor-product roadmap represents an implicit structure consisting of or million-billion vertices.
Dual-arm manipulator: This test (FigureĀ 12) shows the benefits of Ā when planning for two -dimensional arms. FigureĀ 13 shows that Ā fails to return solutions within iterations. Using small roadmaps is also insufficient for this problem. Both Ā and Implicit Ā require larger roadmaps to begin succeeding. But with , Implicit Ā always fails, while Ā maintains a success ratio. As expected, roadmaps of increasing size result in higher quality path. The informed nature of Ā also allows to find initial solutions fast, which together with the branch-and-bound primitive allows for good convergence. The initial solution times in FigureĀ 13 indicate that the heuristic guidance succeeds in finding fast initial solutions even for larger roadmaps.
8.2 Tests on Systems with Shared DoF
This section showcases three benchmarks of increasing difficulty, which are used to evaluate the performance of the . All the experiments were run on a cluster with Intel(R) Xeon(R) CPU E5-4650 @ 2.70GHz processors, and 128GB of RAM. In each benchmark, different sizes of the constituent roadmaps and were evaluated. The algorithm is compared against and . The platforms used are SDA10F, with a torsional DoF, and on a mobile base that can rotate and translate. For the algorithm and all benchmarks, randomly seeded roadmaps with nodes are constructed in and data are gathered from experiments. A node roadmap has million edges, and takes hours to construct in these high dimensional spaces. Larger roadmaps run into memory scalability issues. These roadmaps in the full space occupied MB. In comparison, the space requirement for two arm roadmaps were MB.
For all benchmarks, both and were allowed to run for iterations. is ran in different randomly seeded experiments for every benchmark. For the Ā algorithm, experiments are run for every benchmark, for the different constituent roadmap sizes , by building pairs of randomly seeded constituent roadmaps, and running randomly seeded experiments over each roadmap combination.
Motoman Tabletop Benchmark: A set of random collision-free starts and goals are selected in the tabletop environment, shown in Fig. 14.
They are only used if they are sufficiently far away from each other. is tested with constituent roadmap sizes of and . All the algorithms succeed in every experiment. In this simpler problem, smaller roadmaps are quicker to search, and generate initial solutions faster compared to , as shown in Fig. 15 (top).
Searching the is the fastest (online), but the solution quality is worse than that obtained from the other methods. Ā converges to better solutions, compared to the other algorithms, as shown in Fig 15 (bottom).
MotomanĀ ShelfĀ Benchmark: This benchmark sets up the in front of shelves. The robot has to plan between two states where both arms are inside different shelving units, which require the rotation of its torso (Fig. 16 (top)).
This is a significantly harder problem, and suffers in terms of success ratio (Fig. 16 (second)). takes much longer to find the initial solution, as indicated by Fig. 16 (middle). is still the fastest in finding solutions (only online cost considered again here). The solution cost is much better than both the average solution, and , as shown in Fig. 16 (bottom). Ā will quickly converge for smaller roadmaps, and then stop improving the cost. The larger roadmaps contain better solutions, causing to converge slower.
Mobile Baxter Benchmark: This benchmark uses a Rethink robot with a mobile base. The robot is grasping two long objects inside a shelf Fig. 17 (top). The robot has to navigate across a cramped, walled room, to a placing configuration inside a shelf on the other side of the room.
This proves to be the most challenging problem among the three benchmarks. As shown in Fig. 17 (middle), fails to find a solution. It should be noted that, when tested on a simpler version of the benchmark without the pillar in the room, could find solutions. also falters by showing a very low success rate. This indicates that we need even larger roadmaps in to solve harder problems. The problem is solved when a dense implicit structure, with is explored by .
Fig.Ā 17 (bottom) shows that Ā finds better initial and converged solutions when compared to the instances in which succeeded.
8.3 Real world experiments
Experiments were performed in a 28-dimensional space with two dual-armed manipulators: (a SDA10f and a ). Initial solutions were obtained in a fraction of a second for the two experimental setups, with the method allowed to run for iterations to improve the quality of the demonstrated trajectories. The two setups are chosen carefully to demonstrate in the first instance a typical application of simultaneous grasping that may arise in real world scenarios, and in the second instance a problem that forces very close interactions between the arms in close proximity.
Pre-grasp Demonstration: As shown in Figure 18, the demonstration simulates an application to multi-arm manipulation, where the goals of the motion planning problem for 4 arms is to pre-grasping configurations for 4 objects placed on a table in the shared workspace between the robots. node roadmaps were constructed for each arm and was used to search for a solution to the motion planning problem. The solution was computed offline and an open-loop execution was performed on the real system.
Coupled Workspace Demonstration: As shown in Figure 19, a pole is positioned between the two robots so that the arms cannot cross over. The objective is for the 4 arms to a) approach the pole at alternating heights, b) then swap the height of their approaching configurations, and c) finally return back to the start state. node roadmaps were constructed for each arm and was used to search for a solution to the three motion planning problems. The solutions that were computed offline, were stitched together and replayed in an open loop execution on the real system.
9 Discussion
This work proves asymptotic optimality of sampling-based multi-robot planning over implicit structures by extending the Ā approach. Asymptotic optimality is achieved by making a modification, resulting in Ā which expands a spanning tree over an implicitly defined tensor product roadmap, and leverages a simple re-wiring scheme.
This method already has the advantage of avoiding the construction of a large, dense roadmap in the composite configuration space of many robots. This can be further improved to use heuristics so as to search in an informed manner, in the Ā method. The method is also extended to work with robot systems, which share degrees of freedom, resulting in .
Experimental results show the efficacy of the proposed approaches. Furthermore, by leveraging heuristics, Ā is able to solve more challenging problem instances than the baseline Ā method, and the approach is demonstrated to solve complex, real-world problems with robot manipulators operating in a shared workspace with a high degree of coupling.
In terms of practical applicability, Ā promises fast initial solutions times (Figures 11 and 13) on the order of a fraction of a second for most problems, including for high-dimensional, kinematically independent multi-robot problems, which is an exciting result. The solution quality improvement indicates the anytime properties of the approach, where paths of improved path quality are discovered as more computation time is invested. While problems with shared degrees of freedom provide less guidance and result in performance degradation, the scalability benefits remain even in this case relative to composite planning. Future work includes the consideration of dynamics. The existing theoretical analysis of Ā assumes that the individual robot systems are holonomic, which guarantees the existence of near-optimal single-robot paths (see LemmaĀ 1 andĀ (Janson etĀ al, 2015, TheoremĀ 4.1)). Recent results concerning asymptotic optimality of PRMĀ for non-holonomic systems (seeĀ Schmerling etĀ al (2015a, b)) bring the hope of achieving a more general analysis of the current work as well. The proposed framework can also be leveraged toward efficiently solving simultaneous task and motion planning for many robot manipulators (Dobson and Bekris, 2015). The demonstrated applications to manipulators also motivate dual-arm rearrangement challengesĀ (Shome etĀ al, 2018).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Adler et al (2015) Adler A, De Berg M, Halperin D, Solovey K (2015) Efficient Multi-Robot Motion Planning for Unlabeled Discs in Simple Polygons. In: IEEE Transactions on Automation Science and Engineering, vol 12, Springer, pp 1309ā1317, DOI 10.1109/TASE.2015.2470096 , 1312.1038
- 2Atias et al (2017) Atias A, Solovey K, Halperin D (2017) Effective Metrics for Multi-Robot Motion-Planning. In: Proceedings of Robotics: Science and Systems, Cambridge, Massachusetts, DOI 10.1177/0278364918784660 , 1705.10300
- 3Bonilla et al (2017) Bonilla M, Pallottino L, Bicchi A (2017) Noninteracting constrained motion planning and control for robot manipulators. In: IEEE International Conference on Robotics and Automation, pp 4038ā4043, DOI 10.1109/ICRA.2017.7989463
- 4Caccavale and Uchiyama (2008) Caccavale F, Uchiyama M (2008) Cooperative Manipulators. In: Siciliano B, Khatib O (eds) Springer Handbook of Robotics, Springer, pp 701ā718, DOI 10.1007/978-3-540-30301-5Ė30
- 5Canny (1988) Canny JF (1988) The Complexity of Robot Motion Planning, vol Doctoral D. MIT press, DOI 10.1016/j.scriptamat.2009.11.029
- 6CortĆ©s and SimĆ©on (2005) CortĆ©s J, SimĆ©on T (2005) Sampling-Based Motion Planning under Kinematic Loop-Closure Constraints. In: Workshop on the Algorithmic Foundations of Robotics, pp 75ā90, DOI 10.1007/10991541Ė7
- 7Dobson and Bekris (2013) Dobson A, Bekris KE (2013) A study on the finite-time near-optimality properties of sampling-based motion planners. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp 1236ā1241, DOI 10.1109/IROS.2013.6696508
- 8Dobson and Bekris (2015) Dobson A, Bekris KE (2015) Planning representations and algorithms for prehensile multi-arm manipulation. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, vol 2015-Decem, pp 6381ā6386, DOI 10.1109/IROS.2015.7354289
