GCNT: Graph-Based Transformer Policies for Morphology-Agnostic Reinforcement Learning
Yingbo Luo, Meibao Yao, Xueming Xiao

TL;DR
GCNT introduces a graph-based transformer policy network that enables morphology-agnostic reinforcement learning, allowing robots with diverse structures to learn resilient locomotion behaviors with zero-shot generalization.
Contribution
It proposes a novel GCNT architecture combining GCN and Transformer to extract and utilize morphological information for universal robot control.
Findings
Achieved state-of-the-art performance on 8 benchmark tasks.
Demonstrated zero-shot generalization to unseen robot morphologies.
Generated resilient locomotion across diverse robot configurations.
Abstract
Training a universal controller for robots with different morphologies is a promising research trend, since it can significantly enhance the robustness and resilience of the robotic system. However, diverse morphologies can yield different dimensions of state space and action space, making it difficult to comply with traditional policy networks. Existing methods address this issue by modularizing the robot configuration, while do not adequately extract and utilize the overall morphological information, which has been proven crucial for training a universal controller. To this end, we propose GCNT, a morphology-agnostic policy network based on improved Graph Convolutional Network (GCN) and Transformer. It exploits the fact that GCN and Transformer can handle arbitrary number of modules to achieve compatibility with diverse morphologies. Our key insight is that the GCN is able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Evolutionary Algorithms and Applications
