Search Methods for Policy Decompositions

Ashwin Khadke; Hartmut Geyer

arXiv:2203.15200·cs.RO·March 30, 2022·1 cites

Search Methods for Policy Decompositions

Ashwin Khadke, Hartmut Geyer

PDF

Open Access

TL;DR

This paper explores advanced search methods like Genetic Algorithms and Monte-Carlo Tree Search to efficiently identify effective system decompositions for complex control problems, improving computational tractability and policy quality.

Contribution

It introduces the application of search algorithms to optimize system decompositions in control policy design, addressing combinatorial challenges in policy decomposition.

Findings

01

Successfully identified decompositions for a 4-DOF manipulator

02

Achieved balance control for a simplified biped

03

Demonstrated hover control for a quadcopter

Abstract

Computing optimal control policies for complex dynamical systems requires approximation methods to remain computationally tractable. Several approximation methods have been developed to tackle this problem. However, these methods do not reason about the suboptimality induced in the resulting control policies due to these approximations. We introduced Policy Decomposition, an approximation method that provides a suboptimality estimate, in our earlier work. Policy decomposition proposes strategies to break an optimal control problem into lower-dimensional subproblems, whose optimal solutions are combined to build a control policy for the original system. However, the number of possible strategies to decompose a system scale quickly with the complexity of a system, posing a combinatorial challenge. In this work we investigate the use of Genetic Algorithm and Monte-Carlo Tree Search to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Formal Methods in Verification · Robotic Path Planning Algorithms