On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Anton Dereventsov, Ranga Raju Vatsavai, Clayton Webster

TL;DR
This paper demonstrates that simple state space clustering using k-means significantly accelerates reinforcement learning in personalization tasks without compromising performance.
Contribution
It introduces a straightforward clustering-based RL method that improves learning speed in complex personalization environments.
Findings
Clustering accelerates RL training.
The method maintains high performance levels.
Simple algorithms suffice for effective personalization RL.
Abstract
In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals. In particular, our approach is based on state space clustering with the use of a simplistic -means algorithm as well as conventional choices of the network architectures and optimization algorithms. Numerical examples demonstrate the efficiency of different RL procedures and are used to illustrate that this technique accelerates the agent's ability to learn and does not restrict the agent's performance.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Control Systems and Identification · Fault Detection and Control Systems
