CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization
Derui Wang, Kristen Moore, Diksha Goel, Minjune Kim, Gang Li, Yang Li,, Robin Doss, Minhui Xue, Bo Li, Seyit Camtepe, Liming Zhu

TL;DR
This paper introduces CAMP, a new training paradigm for deep reinforcement learning that maximizes certified robustness radius, leading to significantly improved robustness and return trade-offs without sacrificing provable guarantees.
Contribution
CAMP formulates a novel surrogate loss based on local certified radii and introduces policy imitation to enhance training stability, advancing certifiable robustness in DRL.
Findings
CAMP achieves up to twice the certified expected return compared to baselines.
CAMP improves the robustness-return trade-off across various tasks.
Experimental results validate the effectiveness of CAMP in enhancing robustness.
Abstract
Deep reinforcement learning (DRL) has gained widespread adoption in control and decision-making tasks due to its strong performance in dynamic environments. However, DRL agents are vulnerable to noisy observations and adversarial attacks, and concerns about the adversarial robustness of DRL systems have emerged. Recent efforts have focused on addressing these robustness issues by establishing rigorous theoretical guarantees for the returns achieved by DRL agents in adversarial settings. Among these approaches, policy smoothing has proven to be an effective and scalable method for certifying the robustness of DRL agents. Nevertheless, existing certifiably robust DRL relies on policies trained with simple Gaussian augmentations, resulting in a suboptimal trade-off between certified robustness and certified return. To address this issue, we introduce a novel paradigm dubbed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · Reinforcement Learning in Robotics · Robot Manipulation and Learning
