Search Methods for Policy Decompositions
Ashwin Khadke, Hartmut Geyer

TL;DR
This paper explores advanced search methods like Genetic Algorithms and Monte-Carlo Tree Search to efficiently identify effective system decompositions for complex control problems, improving computational tractability and policy quality.
Contribution
It introduces the application of search algorithms to optimize system decompositions in control policy design, addressing combinatorial challenges in policy decomposition.
Findings
Successfully identified decompositions for a 4-DOF manipulator
Achieved balance control for a simplified biped
Demonstrated hover control for a quadcopter
Abstract
Computing optimal control policies for complex dynamical systems requires approximation methods to remain computationally tractable. Several approximation methods have been developed to tackle this problem. However, these methods do not reason about the suboptimality induced in the resulting control policies due to these approximations. We introduced Policy Decomposition, an approximation method that provides a suboptimality estimate, in our earlier work. Policy decomposition proposes strategies to break an optimal control problem into lower-dimensional subproblems, whose optimal solutions are combined to build a control policy for the original system. However, the number of possible strategies to decompose a system scale quickly with the complexity of a system, posing a combinatorial challenge. In this work we investigate the use of Genetic Algorithm and Monte-Carlo Tree Search to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Formal Methods in Verification · Robotic Path Planning Algorithms
