Learning Modular Robot Control Policies
Julian Whitman, Matthew Travers, and Howie Choset

TL;DR
This paper introduces a modular control policy framework for reconfigurable robots, enabling a single policy to adapt to many designs through a design-conditioned neural network structure and model-based reinforcement learning.
Contribution
The authors propose a novel design graph-based neural policy that generalizes across multiple robot configurations using shared parameters and a unified training process.
Findings
Policy generalizes to unseen robot designs
Effective control demonstrated on simulated and real robots
Model-based RL enables scalable training across designs
Abstract
Modular robots can be rearranged into a new design, perhaps each day, to handle a wide variety of tasks by forming a customized robot for each new task. However, reconfiguring just the mechanism is not sufficient: each design also requires its own unique control policy. One could craft a policy from scratch for each new design, but such an approach is not scalable, especially given the large number of designs that can be generated from even a small set of modules. Instead, we create a modular policy framework where the policy structure is conditioned on the hardware arrangement, and use just one training process to create a policy that controls a wide variety of designs. Our approach leverages the fact that the kinematics of a modular robot can be represented as a design graph, with nodes as modules and edges as connections between them. Given a robot, its design graph is used to create…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · 3D Printing in Biomedical Research · Innovations in Concrete and Construction Materials
