Energy-Based Continuous Inverse Optimal Control

Yifei Xu; Jianwen Xie; Tianyang Zhao; Chris Baker; Yibiao Zhao; and; Ying Nian Wu

arXiv:1904.05453·cs.LG·April 20, 2022·1 cites

Energy-Based Continuous Inverse Optimal Control

Yifei Xu, Jianwen Xie, Tianyang Zhao, Chris Baker, Yibiao Zhao, and, Ying Nian Wu

PDF

Open Access

TL;DR

This paper introduces an energy-based framework for continuous inverse optimal control, learning cost functions from expert trajectories using maximum likelihood, Langevin dynamics, and cooperative training with a trajectory generator, demonstrated on autonomous driving.

Contribution

It proposes a novel energy-based model approach for inverse optimal control, combining sampling and optimization, with a cooperative training scheme for efficiency, applied to autonomous driving tasks.

Findings

01

Effective learning of cost functions from expert data.

02

Improved efficiency through cooperative training with a trajectory generator.

03

Successful application to autonomous driving scenarios.

Abstract

The problem of continuous inverse optimal control (over finite time horizon) is to learn the unknown cost function over the sequence of continuous control variables from expert demonstrations. In this article, we study this fundamental problem in the framework of energy-based model, where the observed expert trajectories are assumed to be random samples from a probability density function defined as the exponential of the negative cost function up to a normalizing constant. The parameters of the cost function are learned by maximum likelihood via an "analysis by synthesis" scheme, which iterates (1) synthesis step: sample the synthesized trajectories from the current probability density using the Langevin dynamics via back-propagation through time, and (2) analysis step: update the model parameters based on the statistical difference between the synthesized trajectories and the observed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Machine Learning in Materials Science · Machine Learning and ELM