A Behavior-Aware Approach for Deep Reinforcement Learning in   Non-stationary Environments without Known Change Points

Zihe Liu; Jie Lu; Guangquan Zhang; Junyu Xuan

arXiv:2405.14214·cs.LG·May 24, 2024

A Behavior-Aware Approach for Deep Reinforcement Learning in Non-stationary Environments without Known Change Points

Zihe Liu, Jie Lu, Guangquan Zhang, Junyu Xuan

PDF

Open Access

TL;DR

This paper presents BADA, a novel framework that detects environmental changes and adapts policies in non-stationary settings by analyzing behavioral variations with Wasserstein distances, improving reinforcement learning robustness.

Contribution

Introducing Behavior-Aware Detection and Adaptation (BADA), a new method that detects environment changes through behavior analysis and adapts policies without manual thresholds.

Findings

01

BADA outperforms existing algorithms in non-stationary environments.

02

Behavior analysis effectively detects environmental changes.

03

Adaptive policies maintain higher performance over time.

Abstract

Deep reinforcement learning is used in various domains, but usually under the assumption that the environment has stationary conditions like transitions and state distributions. When this assumption is not met, performance suffers. For this reason, tracking continuous environmental changes and adapting to unpredictable conditions is challenging yet crucial because it ensures that systems remain reliable and flexible in practical scenarios. Our research introduces Behavior-Aware Detection and Adaptation (BADA), an innovative framework that merges environmental change detection with behavior adaptation. The key inspiration behind our method is that policies exhibit different global behaviors in changing environments. Specifically, environmental changes are identified by analyzing variations between behaviors using Wasserstein distances without manually set thresholds. The model adapts to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making · Evolutionary Algorithms and Applications · Innovation Diffusion and Forecasting

MethodsSparse Evolutionary Training