Curricula for Learning Robust Policies with Factored State Representations in Changing Environments
Panayiotis Panayiotou, \"Ozg\"ur \c{S}im\c{s}ek

TL;DR
This paper investigates how different curricula in reinforcement learning, utilizing factored state representations, can improve the robustness of policies in dynamic and unpredictable environments, with practical experimental insights.
Contribution
It introduces simple curricula strategies that enhance policy robustness in factored state RL, demonstrating their effectiveness through experiments.
Findings
Curricula varying the variable of highest regret improve robustness.
Factored representations aid in generalization and sample efficiency.
Simple curriculum adjustments significantly enhance policy robustness.
Abstract
Robust policies enable reinforcement learning agents to effectively adapt to and operate in unpredictable, dynamic, and ever-changing real-world environments. Factored representations, which break down complex state and action spaces into distinct components, can improve generalization and sample efficiency in policy learning. In this paper, we explore how the curriculum of an agent using a factored state representation affects the robustness of the learned policy. We experimentally demonstrate three simple curricula, such as varying only the variable of highest regret between episodes, that can significantly enhance policy robustness, offering practical insights for reinforcement learning in complex environments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Decision Making · Intelligent Tutoring Systems and Adaptive Learning
