Learning Predictive Models From Observation and Interaction

Karl Schmeckpeper; Annie Xie; Oleh Rybkin; Stephen Tian; Kostas; Daniilidis; Sergey Levine; Chelsea Finn

arXiv:1912.12773·cs.LG·January 1, 2020

Learning Predictive Models From Observation and Interaction

Karl Schmeckpeper, Annie Xie, Oleh Rybkin, Stephen Tian, Kostas, Daniilidis, Sergey Levine, Chelsea Finn

PDF

TL;DR

This paper introduces a method for robots to learn predictive models by combining interaction data with observational videos, enabling skill acquisition without direct robotic demonstrations.

Contribution

It formulates a graphical model that treats actions as observed or unobserved variables and uses domain priors to leverage passive observational data for robotic learning.

Findings

01

Robots can learn tool use from human videos without robotic demonstrations.

02

The method effectively integrates observational and interaction data.

03

Robots successfully perform manipulation tasks after training with diverse datasets.

Abstract

Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works, and then use this learned model to plan coordinated sequences of actions to bring about desired outcomes. However, learning a model that captures the dynamics of complex skills represents a major challenge: if the agent needs a good model to perform these skills, it might never be able to collect the experience on its own that is required to learn these delicate and complex behaviors. Instead, we can imagine augmenting the training set with observational data of other agents, such as humans. Such data is likely more plentiful, but represents a different embodiment. For example, videos of humans might show a robot how to use a tool, but (i) are not annotated with suitable robot actions, and (ii) contain a systematic distributional shift due to the embodiment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.