Guided Cost Learning: Deep Inverse Optimal Control via Policy   Optimization

Chelsea Finn; Sergey Levine; Pieter Abbeel

arXiv:1603.00448·cs.LG·May 30, 2016·369 cites

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization

Chelsea Finn, Sergey Levine, Pieter Abbeel

PDF

Open Access 4 Repos

TL;DR

This paper introduces a deep inverse optimal control method that learns complex, nonlinear cost functions directly from demonstrations, improving sample efficiency and performance in high-dimensional robotic tasks.

Contribution

It presents a novel algorithm for learning arbitrary nonlinear cost functions without feature engineering and an efficient sample-based approach for MaxEnt IOC in high-dimensional systems.

Findings

01

Significant improvement over prior methods in task complexity.

02

Enhanced sample efficiency in robotic manipulation tasks.

03

Effective learning of neural network-based cost functions.

Abstract

Reinforcement learning can acquire complex behaviors from high-level specifications. However, defining a cost function that can be optimized effectively and encodes the correct task is challenging in practice. We explore how inverse optimal control (IOC) can be used to learn behaviors from demonstrations, with applications to torque control of high-dimensional robotic systems. Our method addresses two key challenges in inverse optimal control: first, the need for informative features and effective regularization to impose structure on the cost, and second, the difficulty of learning the cost function under unknown dynamics for high-dimensional continuous systems. To address the former challenge, we present an algorithm capable of learning arbitrary nonlinear cost functions, such as neural networks, without meticulous feature engineering. To address the latter challenge, we formulate an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptical Imaging and Spectroscopy Techniques · Reinforcement Learning in Robotics · Gaussian Processes and Bayesian Inference