Visual processing in context of reinforcement learning
Hlynur Dav\'i{\dh} Hlynsson

TL;DR
This paper explores three novel unsupervised representation learning algorithms to improve data efficiency in deep reinforcement learning by leveraging unlabeled data and auxiliary tasks.
Contribution
It introduces three new algorithms—GRICA, LARP, and RewPred—that utilize different data sources to learn effective state representations for reinforcement learning.
Findings
Unsupervised representation learning accelerates RL training.
Each method has unique strengths and weaknesses.
Including these methods improves learning speed in RL tasks.
Abstract
Although deep reinforcement learning (RL) has recently enjoyed many successes, its methods are still data inefficient, which makes solving numerous problems prohibitively expensive in terms of data. We aim to remedy this by taking advantage of the rich supervisory signal in unlabeled data for learning state representations. This thesis introduces three different representation learning algorithms that have access to different subsets of the data sources that traditional RL algorithms use: (i) GRICA is inspired by independent component analysis (ICA) and trains a deep neural network to output statistically independent features of the input. GrICA does so by minimizing the mutual information between each feature and the other features. Additionally, GrICA only requires an unsorted collection of environment states. (ii) Latent Representation Prediction (LARP) requires more context: in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Data Stream Mining Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
