Transform2Act: Learning a Transform-and-Control Policy for Efficient   Agent Design

Ye Yuan; Yuda Song; Zhengyi Luo; Wen Sun; Kris Kitani

arXiv:2110.03659·cs.LG·April 12, 2022·5 cites

Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design

Ye Yuan, Yuda Song, Zhengyi Luo, Wen Sun, Kris Kitani

PDF

Open Access 1 Repo 1 Video

TL;DR

Transform2Act introduces a novel policy that simultaneously optimizes agent design and control by applying sequential transformations and message-passing graph neural networks, significantly improving efficiency and performance.

Contribution

It proposes a transform-and-control policy framework that integrates agent design modification into decision-making, enabling joint optimization and better sample efficiency.

Findings

01

Outperforms prior methods in convergence speed and final performance

02

Automatically discovers biologically plausible agent designs

03

Uses message passing to handle variable joint configurations

Abstract

An agent's functionality is largely determined by its design, i.e., skeletal structure and joint attributes (e.g., length, size, strength). However, finding the optimal agent design for a given function is extremely challenging since the problem is inherently combinatorial and the design space is prohibitively large. Additionally, it can be costly to evaluate each candidate design which requires solving for its optimal controller. To tackle these problems, our key idea is to incorporate the design procedure of an agent into its decision-making process. Specifically, we learn a conditional policy that, in an episode, first applies a sequence of transform actions to modify an agent's skeletal structure and joint attributes, and then applies control actions under the new design. To handle a variable number of joints across designs, we use a graph-based policy where each graph node…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Khrylx/Transform2Act
pytorchOfficial

Videos

Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Machine Learning and Data Classification