Assessing the Impact of Distribution Shift on Reinforcement Learning   Performance

Ted Fujimoto; Joshua Suetterlein; Samrat Chatterjee; Auroop; Ganguly

arXiv:2402.03590·cs.LG·February 7, 2024·1 cites

Assessing the Impact of Distribution Shift on Reinforcement Learning Performance

Ted Fujimoto, Joshua Suetterlein, Samrat Chatterjee, Auroop, Ganguly

PDF

Open Access

TL;DR

This paper introduces evaluation methods to measure the robustness of reinforcement learning algorithms under distribution shifts, emphasizing the importance of time series analysis and causal impact measurement for more reliable assessments.

Contribution

It proposes new evaluation tools for RL robustness under distribution shifts, incorporating time series analysis and causal impact measurement, addressing a gap in current reliability metrics.

Findings

01

Distribution shifts significantly affect RL performance.

02

Time series analysis provides better robustness insights.

03

Causal impact measurement helps understand environment effects.

Abstract

Research in machine learning is making progress in fixing its own reproducibility crisis. Reinforcement learning (RL), in particular, faces its own set of unique challenges. Comparison of point estimates, and plots that show successful convergence to the optimal policy during training, may obfuscate overfitting or dependence on the experimental setup. Although researchers in RL have proposed reliability metrics that account for uncertainty to better understand each algorithm's strengths and weaknesses, the recommendations of past work do not assume the presence of out-of-distribution observations. We propose a set of evaluation methods that measure the robustness of RL algorithms under distribution shifts. The tools presented here argue for the need to account for performance over time while the agent is acting in its environment. In particular, we recommend time series analysis as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSupply Chain and Inventory Management · Evolutionary Algorithms and Applications

MethodsSparse Evolutionary Training