Goal-Driven Dynamics Learning via Bayesian Optimization

Somil Bansal; Roberto Calandra; Ted Xiao; Sergey Levine; Claire J.; Tomlin

arXiv:1703.09260·cs.SY·September 25, 2017·34 cites

Goal-Driven Dynamics Learning via Bayesian Optimization

Somil Bansal, Roberto Calandra, Ted Xiao, Sergey Levine, Claire J., Tomlin

PDF

Open Access

TL;DR

This paper introduces a task-specific, Bayesian optimization-based method for learning local linear dynamics models to improve control performance in complex robots, demonstrated through simulations and real quadrotor experiments.

Contribution

It proposes a novel active learning framework that directly optimizes dynamics models for control performance rather than true dynamics, using Bayesian optimization.

Findings

01

Effective in improving control performance on a quadrotor

02

Successful in both simulation and real-world experiments

03

Demonstrates rapid convergence to desired performance

Abstract

Real-world robots are becoming increasingly complex and commonly act in poorly understood environments where it is extremely challenging to model or learn their true dynamics. Therefore, it might be desirable to take a task-specific approach, wherein the focus is on explicitly learning the dynamics model which achieves the best control performance for the task at hand, rather than learning the true dynamics. In this work, we use Bayesian optimization in an active learning framework where a locally linear dynamics model is learned with the intent of maximizing the control performance, and used in conjunction with optimal control schemes to efficiently design a controller for a given task. This model is updated directly based on the performance observed in experiments on the physical system in an iterative manner until a desired performance is achieved. We demonstrate the efficacy of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research · Machine Learning and Algorithms