Exploiting generalization in the subspaces for faster model-based learning
Maryam Hashemzadeh, Reshad Hosseini, Majid Nili Ahmadabadi

TL;DR
This paper presents MoBLeS, a model-based reinforcement learning method that leverages subspace generalization to accelerate early learning by balancing experience sharing and perceptual aliasing effects.
Contribution
Introduces MoBLeS, a novel approach that uses confidence intervals in subspaces to improve early learning speed in discrete state-space RL.
Findings
MoBLeS accelerates early learning in experiments.
Theoretical convergence to the optimal policy is established.
Balances generalization benefits with perceptual aliasing risks.
Abstract
Due to the lack of enough generalization in the state-space, common methods in Reinforcement Learning (RL) suffer from slow learning speed especially in the early learning trials. This paper introduces a model-based method in discrete state-spaces for increasing learning speed in terms of required experience (but not required computational time) by exploiting generalization in the experiences of the subspaces. A subspace is formed by choosing a subset of features in the original state representation (full-space). Generalization and faster learning in a subspace are due to many-to-one mapping of experiences from the full-space to each state in the subspace. Nevertheless, due to inherent perceptual aliasing in the subspaces, the policy suggested by each subspace does not generally converge to the optimal policy. Our approach, called Model Based Learning with Subspaces (MoBLeS), calculates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Machine Learning and Algorithms
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
