Learning Modular Robot Locomotion from Demonstrations
Julian Whitman, Howie Choset

TL;DR
This paper introduces a modular, graph neural network-based approach that uses demonstrations from certain robot designs to efficiently learn controllers for new designs, reducing training time.
Contribution
It presents a novel combined reinforcement and imitation learning method for modular robots, enabling cross-design policy transfer within a unified framework.
Findings
Demonstrations from one set of designs help learn controllers for new designs.
The combined learning approach reduces training iterations needed for new designs.
Modular policies generalize across different robot configurations.
Abstract
Modular robots can be reconfigured to create a variety of designs from a small set of components. But constructing a robot's hardware on its own is not enough -- each robot needs a controller. One could create controllers for some designs individually, but developing policies for additional designs can be time consuming. This work presents a method that uses demonstrations from one set of designs to accelerate policy learning for additional designs. We leverage a learning framework in which a graph neural network is made up of modular components, each component corresponds to a type of module (e.g., a leg, wheel, or body) and these components can be recombined to learn from multiple designs at once. In this paper we develop a combined reinforcement and imitation learning algorithm. Our method is novel because the policy is optimized to both maximize a reward for one design, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · 3D Printing in Biomedical Research
MethodsGraph Neural Network
