Curricula for Learning Robust Policies with Factored State   Representations in Changing Environments

Panayiotis Panayiotou; \"Ozg\"ur \c{S}im\c{s}ek

arXiv:2409.09169·cs.LG·September 20, 2024

Curricula for Learning Robust Policies with Factored State Representations in Changing Environments

Panayiotis Panayiotou, \"Ozg\"ur \c{S}im\c{s}ek

PDF

Open Access

TL;DR

This paper investigates how different curricula in reinforcement learning, utilizing factored state representations, can improve the robustness of policies in dynamic and unpredictable environments, with practical experimental insights.

Contribution

It introduces simple curricula strategies that enhance policy robustness in factored state RL, demonstrating their effectiveness through experiments.

Findings

01

Curricula varying the variable of highest regret improve robustness.

02

Factored representations aid in generalization and sample efficiency.

03

Simple curriculum adjustments significantly enhance policy robustness.

Abstract

Robust policies enable reinforcement learning agents to effectively adapt to and operate in unpredictable, dynamic, and ever-changing real-world environments. Factored representations, which break down complex state and action spaces into distinct components, can improve generalization and sample efficiency in policy learning. In this paper, we explore how the curriculum of an agent using a factored state representation affects the robustness of the learned policy. We experimentally demonstrate three simple curricula, such as varying only the variable of highest regret between episodes, that can significantly enhance policy robustness, offering practical insights for reinforcement learning in complex environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making · Intelligent Tutoring Systems and Adaptive Learning