Stronger Generalization Guarantees for Robot Learning by Combining   Generative Models and Real-World Data

Abhinav Agarwal; Sushant Veer; Allen Z. Ren; Anirudha Majumdar

arXiv:2111.08761·cs.RO·July 25, 2022

Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

Abhinav Agarwal, Sushant Veer, Allen Z. Ren, Anirudha Majumdar

PDF

Open Access

TL;DR

This paper introduces a framework that combines generative models with real-world data to provide stronger guarantees for robotic policy generalization in unseen environments, validated through simulations and hardware experiments.

Contribution

The paper proposes a novel method that uses generative models as priors and updates them with real data to improve generalization guarantees in robot learning.

Findings

01

Outperforms prior methods in simulation tests.

02

Provides validated bounds on generalization in hardware experiments.

03

Effective in complex sensory and dynamic environments.

Abstract

We are motivated by the problem of learning policies for robotic systems with rich sensory inputs (e.g., vision) in a manner that allows us to guarantee generalization to environments unseen during training. We provide a framework for providing such generalization guarantees by leveraging a finite dataset of real-world environments in combination with a (potentially inaccurate) generative model of environments. The key idea behind our approach is to utilize the generative model in order to implicitly specify a prior over policies. This prior is updated using the real-world dataset of environments by minimizing an upper bound on the expected cost across novel environments derived via Probably Approximately Correct (PAC)-Bayes generalization theory. We demonstrate our approach on two simulated systems with nonlinear/hybrid dynamics and rich sensing modalities: (i) quadrotor navigation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Neural Networks and Applications · Reinforcement Learning in Robotics