Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
Rory Young, Nicolas Pugeault

TL;DR
This paper investigates the chaotic behavior of deep reinforcement learning policies under small perturbations and proposes a Lyapunov exponent regularization method to improve their robustness for real-world applications.
Contribution
The paper introduces a Lyapunov exponent regularization technique to reduce chaos in deep RL policies, enhancing their robustness against noise and adversarial attacks.
Findings
Chaotic behavior in RL policies can cause significant performance issues.
Lyapunov regularization reduces chaos and improves policy resilience.
Enhanced robustness makes deep RL more applicable to real-world scenarios.
Abstract
Deep reinforcement learning agents achieve state-of-the-art performance in a wide range of simulated control tasks. However, successful applications to real-world problems remain limited. One reason for this dichotomy is because the learnt policies are not robust to observation noise or adversarial attacks. In this paper, we investigate the robustness of deep RL policies to a single small state perturbation in deterministic continuous control tasks. We demonstrate that RL policies can be deterministically chaotic, as small perturbations to the system state have a large impact on subsequent state and reward trajectories. This unstable non-linear behaviour has two consequences: first, inaccuracies in sensor readings, or adversarial attacks, can cause significant performance degradation; second, even policies that show robust performance in terms of rewards may have unpredictable behaviour…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · stochastic dynamics and bifurcation · Evolutionary Algorithms and Applications
