Component Transfer Learning for Deep RL Based on Abstract Representations
Geoffrey van Driessel, Vincent Francois-Lavet

TL;DR
This paper proposes a transfer learning method for deep reinforcement learning that leverages shared low-dimensional abstract representations to transfer knowledge between tasks with similar dynamics but different visual inputs.
Contribution
It introduces a transfer approach based on freezing learned internal dynamics and value functions within a low-dimensional embedding space, highlighting the effects of local minima and base model choice.
Findings
Frozen models can cause local minima affecting transfer quality.
Successful transfer depends heavily on the choice of the base model.
Converged embeddings enable faster transfer compared to learning from scratch.
Abstract
In this work we investigate a specific transfer learning approach for deep reinforcement learning in the context where the internal dynamics between two tasks are the same but the visual representations differ. We learn a low-dimensional encoding of the environment, meant to capture summarizing abstractions, from which the internal dynamics and value functions are learned. Transfer is then obtained by freezing the learned internal dynamics and value functions, thus reusing the shared low-dimensional embedding space. When retraining the encoder for transfer, we make several observations: (i) in some cases, there are local minima that have small losses but a mismatching embedding space, resulting in poor task performance and (ii) in the absence of local minima, the output of the encoder converges in our experiments to the same embedding space, which leads to a fast and efficient transfer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
