Generalizing Skills with Semi-Supervised Reinforcement Learning
Chelsea Finn, Tianhe Yu, Justin Fu, Pieter Abbeel, Sergey Levine

TL;DR
This paper introduces a semi-supervised reinforcement learning framework that enables agents to generalize skills across diverse environments by leveraging limited labeled data and unlabeled experience, improving policy robustness from image-based inputs.
Contribution
The authors formalize semi-supervised RL and propose an inverse RL-inspired algorithm to infer rewards in unlabeled environments, enhancing generalization of deep policies.
Findings
Improved policy generalization in image-based control tasks.
Outperforms supervised reward learning methods.
Effective use of limited labeled data for broad environment adaptation.
Abstract
Deep reinforcement learning (RL) can acquire complex behaviors from low-level inputs, such as images. However, real-world applications of such methods require generalizing to the vast variability of the real world. Deep networks are known to achieve remarkable generalization when provided with massive amounts of labeled data, but can we provide this breadth of experience to an RL agent, such as a robot? The robot might continuously learn as it explores the world around it, even while deployed. However, this learning requires access to a reward function, which is often hard to measure in real-world domains, where the reward could depend on, for example, unknown positions of objects or the emotional state of the user. Conversely, it is often quite practical to provide the agent with reward functions in a limited set of situations, such as when a human supervisor is present or in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Robot Manipulation and Learning
