Doubly Inhomogeneous Reinforcement Learning
Liyuan Hu, Mengbing Li, Chengchun Shi, Zhenke Wu, Piotr, Fryzlewicz

TL;DR
This paper introduces a novel reinforcement learning algorithm designed to handle environments with changing dynamics over time and across individuals, improving decision-making in non-stationary, heterogeneous settings.
Contribution
It proposes an original, robust method combining change point detection and clustering to identify stable data segments for better policy learning in doubly inhomogeneous environments.
Findings
Method effectively detects change points and clusters in complex environments.
Algorithm outperforms existing approaches in simulations.
Real data application demonstrates practical utility.
Abstract
This paper studies reinforcement learning (RL) in doubly inhomogeneous environments under temporal non-stationarity and subject heterogeneity. In a number of applications, it is commonplace to encounter datasets generated by system dynamics that may change over time and population, challenging high-quality sequential decision making. Nonetheless, most existing RL solutions require either temporal stationarity or subject homogeneity, which would result in sub-optimal policies if both assumptions were violated. To address both challenges simultaneously, we propose an original algorithm to determine the ``best data chunks" that display similar dynamics over time and across individuals for policy learning, which alternates between most recent change point detection and cluster identification. Our method is general, and works with a wide range of clustering and change point detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
