AnyMorph: Learning Transferable Polices By Inferring Agent Morphology
Brandon Trabucco, Mariano Phielipp, Glen Berseth

TL;DR
This paper introduces AnyMorph, a reinforcement learning method that learns to generalize policies to new agent morphologies without prior descriptions, enabling zero-shot transfer across diverse agents.
Contribution
We propose a data-driven approach that learns morphology representations directly from the RL objective, eliminating the need for hand-designed agent descriptions.
Findings
Improves zero-shot generalization to unseen agent morphologies.
Achieves state-of-the-art performance on agent-agnostic control benchmarks.
Operates without explicit morphological descriptions.
Abstract
The prototypical approach to reinforcement learning involves training policies tailored to a particular agent from scratch for every new morphology. Recent work aims to eliminate the re-training of policies by investigating whether a morphology-agnostic policy, trained on a diverse set of agents with similar task objectives, can be transferred to new agents with unseen morphologies without re-training. This is a challenging problem that required previous approaches to use hand-designed descriptions of the new agent's morphology. Instead of hand-designing this description, we propose a data-driven method that learns a representation of morphology directly from the reinforcement learning objective. Ours is the first reinforcement learning algorithm that can train a policy to generalize to new agent morphologies without requiring a description of the agent's morphology in advance. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
