Doubly Inhomogeneous Reinforcement Learning

Liyuan Hu; Mengbing Li; Chengchun Shi; Zhenke Wu; Piotr; Fryzlewicz

arXiv:2211.03983·stat.ML·March 18, 2025·1 cites

Doubly Inhomogeneous Reinforcement Learning

Liyuan Hu, Mengbing Li, Chengchun Shi, Zhenke Wu, Piotr, Fryzlewicz

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel reinforcement learning algorithm designed to handle environments with changing dynamics over time and across individuals, improving decision-making in non-stationary, heterogeneous settings.

Contribution

It proposes an original, robust method combining change point detection and clustering to identify stable data segments for better policy learning in doubly inhomogeneous environments.

Findings

01

Method effectively detects change points and clusters in complex environments.

02

Algorithm outperforms existing approaches in simulations.

03

Real data application demonstrates practical utility.

Abstract

This paper studies reinforcement learning (RL) in doubly inhomogeneous environments under temporal non-stationarity and subject heterogeneity. In a number of applications, it is commonplace to encounter datasets generated by system dynamics that may change over time and population, challenging high-quality sequential decision making. Nonetheless, most existing RL solutions require either temporal stationarity or subject homogeneity, which would result in sub-optimal policies if both assumptions were violated. To address both challenges simultaneously, we propose an original algorithm to determine the ``best data chunks" that display similar dynamics over time and across individuals for policy learning, which alternates between most recent change point detection and cluster identification. Our method is general, and works with a wide range of clustering and change point detection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zaza0209/dirl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics