Exploiting generalization in the subspaces for faster model-based   learning

Maryam Hashemzadeh; Reshad Hosseini; Majid Nili Ahmadabadi

arXiv:1710.08012·stat.ML·October 30, 2017·1 cites

Exploiting generalization in the subspaces for faster model-based learning

Maryam Hashemzadeh, Reshad Hosseini, Majid Nili Ahmadabadi

PDF

Open Access

TL;DR

This paper presents MoBLeS, a model-based reinforcement learning method that leverages subspace generalization to accelerate early learning by balancing experience sharing and perceptual aliasing effects.

Contribution

Introduces MoBLeS, a novel approach that uses confidence intervals in subspaces to improve early learning speed in discrete state-space RL.

Findings

01

MoBLeS accelerates early learning in experiments.

02

Theoretical convergence to the optimal policy is established.

03

Balances generalization benefits with perceptual aliasing risks.

Abstract

Due to the lack of enough generalization in the state-space, common methods in Reinforcement Learning (RL) suffer from slow learning speed especially in the early learning trials. This paper introduces a model-based method in discrete state-spaces for increasing learning speed in terms of required experience (but not required computational time) by exploiting generalization in the experiences of the subspaces. A subspace is formed by choosing a subset of features in the original state representation (full-space). Generalization and faster learning in a subspace are due to many-to-one mapping of experiences from the full-space to each state in the subspace. Nevertheless, due to inherent perceptual aliasing in the subspaces, the policy suggested by each subspace does not generally converge to the optimal policy. Our approach, called Model Based Learning with Subspaces (MoBLeS), calculates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Machine Learning and Algorithms

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings