Visual processing in context of reinforcement learning

Hlynur Dav\'i{\dh} Hlynsson

arXiv:2208.12525·cs.LG·August 29, 2022

Visual processing in context of reinforcement learning

Hlynur Dav\'i{\dh} Hlynsson

PDF

Open Access

TL;DR

This paper explores three novel unsupervised representation learning algorithms to improve data efficiency in deep reinforcement learning by leveraging unlabeled data and auxiliary tasks.

Contribution

It introduces three new algorithms—GRICA, LARP, and RewPred—that utilize different data sources to learn effective state representations for reinforcement learning.

Findings

01

Unsupervised representation learning accelerates RL training.

02

Each method has unique strengths and weaknesses.

03

Including these methods improves learning speed in RL tasks.

Abstract

Although deep reinforcement learning (RL) has recently enjoyed many successes, its methods are still data inefficient, which makes solving numerous problems prohibitively expensive in terms of data. We aim to remedy this by taking advantage of the rich supervisory signal in unlabeled data for learning state representations. This thesis introduces three different representation learning algorithms that have access to different subsets of the data sources that traditional RL algorithms use: (i) GRICA is inspired by independent component analysis (ICA) and trains a deep neural network to output statistically independent features of the input. GrICA does so by minimizing the mutual information between each feature and the other features. Additionally, GrICA only requires an unsorted collection of environment states. (ii) Latent Representation Prediction (LARP) requires more context: in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Data Stream Mining Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings