Tuning the Weights: The Impact of Initial Matrix Configurations on Successor Features Learning Efficacy
Hyunsu Lee

TL;DR
This study examines how different initializations of the weight matrix in Successor Features affect learning efficiency in RL agents, finding that random initializations lead to faster convergence and more efficient learning.
Contribution
It introduces a systematic comparison of initialization strategies for Successor Features in RL, highlighting the benefits of random matrices for faster learning.
Findings
Random matrix initialization accelerates convergence to optimal SR place fields.
Randomly initialized agents show quicker reduction in value error.
Faster decrease in step length in larger environments with random initialization.
Abstract
The focus of this study is to investigate the impact of different initialization strategies for the weight matrix of Successor Features (SF) on learning efficiency and convergence in Reinforcement Learning (RL) agents. Using a grid-world paradigm, we compare the performance of RL agents, whose SF weight matrix is initialized with either an identity matrix, zero matrix, or a randomly generated matrix (using Xavier, He, or uniform distribution method). Our analysis revolves around evaluating metrics such as value error, step length, PCA of Successor Representation (SR) place field, and the distance of SR matrices between different agents. The results demonstrate that RL agents initialized with random matrices reach the optimal SR place field faster and showcase a quicker reduction in value error, pointing to more efficient learning. Furthermore, these random agents also exhibit a faster…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Innovation Diffusion and Forecasting · Evolutionary Algorithms and Applications
