Self-supervised Learning of Image Embedding for Continuous Control
Carlos Florensa, Jonas Degrave, Nicolas Heess, Jost Tobias, Springenberg, Martin Riedmiller

TL;DR
This paper proposes a self-supervised approach for learning general image embeddings and control primitives for robotic tasks, improving transferability and performance without relying on task-specific rewards.
Contribution
It introduces a novel self-supervised learning method for image embeddings and a new state-action value function structure connecting model-free and model-based methods.
Findings
Effective in three simulated robotic tasks
Improves transferability of learned representations
Enhances learning performance with new value function structure
Abstract
Operating directly from raw high dimensional sensory inputs like images is still a challenge for robotic control. Recently, Reinforcement Learning methods have been proposed to solve specific tasks end-to-end, from pixels to torques. However, these approaches assume the access to a specified reward which may require specialized instrumentation of the environment. Furthermore, the obtained policy and representations tend to be task specific and may not transfer well. In this work we investigate completely self-supervised learning of a general image embedding and control primitives, based on finding the shortest time to reach any state. We also introduce a new structure for the state-action value function that builds a connection between model-free and model-based methods, and improves the performance of the learning algorithm. We experimentally demonstrate these findings in three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Control Systems Optimization · Adaptive Dynamic Programming Control
