Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design
Ye Yuan, Yuda Song, Zhengyi Luo, Wen Sun, Kris Kitani

TL;DR
Transform2Act introduces a novel policy that simultaneously optimizes agent design and control by applying sequential transformations and message-passing graph neural networks, significantly improving efficiency and performance.
Contribution
It proposes a transform-and-control policy framework that integrates agent design modification into decision-making, enabling joint optimization and better sample efficiency.
Findings
Outperforms prior methods in convergence speed and final performance
Automatically discovers biologically plausible agent designs
Uses message passing to handle variable joint configurations
Abstract
An agent's functionality is largely determined by its design, i.e., skeletal structure and joint attributes (e.g., length, size, strength). However, finding the optimal agent design for a given function is extremely challenging since the problem is inherently combinatorial and the design space is prohibitively large. Additionally, it can be costly to evaluate each candidate design which requires solving for its optimal controller. To tackle these problems, our key idea is to incorporate the design procedure of an agent into its decision-making process. Specifically, we learn a conditional policy that, in an episode, first applies a sequence of transform actions to modify an agent's skeletal structure and joint attributes, and then applies control actions under the new design. To handle a variable number of joints across designs, we use a graph-based policy where each graph node…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Machine Learning and Data Classification
