Generalizing Skills with Semi-Supervised Reinforcement Learning

Chelsea Finn; Tianhe Yu; Justin Fu; Pieter Abbeel; Sergey Levine

arXiv:1612.00429·cs.LG·March 13, 2017·34 cites

Generalizing Skills with Semi-Supervised Reinforcement Learning

Chelsea Finn, Tianhe Yu, Justin Fu, Pieter Abbeel, Sergey Levine

PDF

Open Access

TL;DR

This paper introduces a semi-supervised reinforcement learning framework that enables agents to generalize skills across diverse environments by leveraging limited labeled data and unlabeled experience, improving policy robustness from image-based inputs.

Contribution

The authors formalize semi-supervised RL and propose an inverse RL-inspired algorithm to infer rewards in unlabeled environments, enhancing generalization of deep policies.

Findings

01

Improved policy generalization in image-based control tasks.

02

Outperforms supervised reward learning methods.

03

Effective use of limited labeled data for broad environment adaptation.

Abstract

Deep reinforcement learning (RL) can acquire complex behaviors from low-level inputs, such as images. However, real-world applications of such methods require generalizing to the vast variability of the real world. Deep networks are known to achieve remarkable generalization when provided with massive amounts of labeled data, but can we provide this breadth of experience to an RL agent, such as a robot? The robot might continuously learn as it explores the world around it, even while deployed. However, this learning requires access to a reward function, which is often hard to measure in real-world domains, where the reward could depend on, for example, unknown positions of objects or the emotional state of the user. Conversely, it is often quite practical to provide the agent with reward functions in a limited set of situations, such as when a human supervisor is present or in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Robot Manipulation and Learning