Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning
Julen Urain, Anqi Li, Puze Liu, Carlo D'Eramo, Jan Peters

TL;DR
This paper introduces Composable Energy Policies (CEP), a modular framework for reactive motion generation that optimizes over the product of policies to handle conflicts and integrates reinforcement learning with flexible priors.
Contribution
The paper proposes CEP, a novel approach that combines multiple policies through optimization over their product, enabling conflict resolution and hierarchical reinforcement learning integration.
Findings
CEP effectively resolves conflicting behaviors in reactive motion generation.
CEP naturally integrates diverse priors into reinforcement learning.
Experimental results demonstrate improved motion control performance.
Abstract
Reactive motion generation problems are usually solved by computing actions as a sum of policies. However, these policies are independent of each other and thus, they can have conflicting behaviors when summing their contributions together. We introduce Composable Energy Policies (CEP), a novel framework for modular reactive motion generation. CEP computes the control action by optimization over the product of a set of stochastic policies. This product of policies will provide a high probability to those actions that satisfy all the components and low probability to the others. Optimizing over the product of the policies avoids the detrimental effect of conflicting behaviors between policies choosing an action that satisfies all the objectives. Besides, we show that CEP naturally adapts to the Reinforcement Learning problem allowing us to integrate, in a hierarchical fashion, any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Reinforcement Learning in Robotics
