LVIS: Learning from Value Function Intervals for Contact-Aware Robot   Controllers

Robin Deits; Twan Koolen; Russ Tedrake

arXiv:1809.05802·cs.RO·September 18, 2018

LVIS: Learning from Value Function Intervals for Contact-Aware Robot Controllers

Robin Deits, Twan Koolen, Russ Tedrake

PDF

1 Repo

TL;DR

LVIS introduces a novel approach for contact-aware robot control by learning value function intervals through partial mixed-integer optimization, enabling effective policy training and control in contact-rich environments.

Contribution

It proposes a new method that combines global mixed-integer optimization with value function learning to address local minima and non-uniqueness in contact-rich control tasks.

Findings

01

Successfully applied to cart-pole with walls

02

Effective on planar humanoid robot model

03

Demonstrates control through contact

Abstract

Guided policy search is a popular approach for training controllers for high-dimensional systems, but it has a number of pitfalls. Non-convex trajectory optimization has local minima, and non-uniqueness in the optimal policy itself can mean that independently-optimized samples do not describe a coherent policy from which to train. We introduce LVIS, which circumvents the issue of local minima through global mixed-integer optimization and the issue of non-uniqueness through learning the optimal value function (or cost-to-go) rather than the optimal policy. To avoid the expense of solving the mixed-integer programs to full global optimality, we instead solve them only partially, extracting intervals containing the true cost-to-go from early termination of the branch-and-bound algorithm. These interval samples are used to weakly supervise the training of a neural net which approximates the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rdeits/LVIS-dev
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.