Jointly Learning to Construct and Control Agents using Deep   Reinforcement Learning

Charles Schaff; David Yunis; Ayan Chakrabarti; Matthew R. Walter

arXiv:1801.01432·cs.RO·September 18, 2018

Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning

Charles Schaff, David Yunis, Ayan Chakrabarti, Matthew R. Walter

PDF

3 Repos

TL;DR

This paper introduces a joint optimization method for physical robot design and control policies using deep reinforcement learning, enabling efficient discovery of optimal designs and behaviors.

Contribution

It presents a novel approach that simultaneously optimizes robot design and control policy, overcoming the inefficiency of separate training for each design.

Findings

01

Discovered novel robot designs and gaits.

02

Outperformed baseline methods in performance.

03

Achieved more efficient joint optimization process.

Abstract

The physical design of a robot and the policy that controls its motion are inherently coupled, and should be determined according to the task and environment. In an increasing number of applications, data-driven and learning-based approaches, such as deep reinforcement learning, have proven effective at designing control policies. For most tasks, the only way to evaluate a physical design with respect to such control policies is empirical--i.e., by picking a design and training a control policy for it. Since training these policies is time-consuming, it is computationally infeasible to train separate policies for all possible designs as a means to identify the best one. In this work, we address this limitation by introducing a method that performs simultaneous joint optimization of the physical design and control network. Our approach maintains a distribution over designs and uses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.