On the Unreasonable Efficiency of State Space Clustering in   Personalization Tasks

Anton Dereventsov; Ranga Raju Vatsavai; Clayton Webster

arXiv:2112.13141·cs.LG·December 28, 2021

On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

Anton Dereventsov, Ranga Raju Vatsavai, Clayton Webster

PDF

Open Access 2 Repos

TL;DR

This paper demonstrates that simple state space clustering using k-means significantly accelerates reinforcement learning in personalization tasks without compromising performance.

Contribution

It introduces a straightforward clustering-based RL method that improves learning speed in complex personalization environments.

Findings

01

Clustering accelerates RL training.

02

The method maintains high performance levels.

03

Simple algorithms suffice for effective personalization RL.

Abstract

In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals. In particular, our approach is based on state space clustering with the use of a simplistic $k$ -means algorithm as well as conventional choices of the network architectures and optimization algorithms. Numerical examples demonstrate the efficiency of different RL procedures and are used to illustrate that this technique accelerates the agent's ability to learn and does not restrict the agent's performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Control Systems and Identification · Fault Detection and Control Systems