Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Chung-En Sun, Sicun Gao, Tsui-Wei Weng

TL;DR
This paper introduces S-DQN and S-PPO algorithms that significantly improve robustness and clean rewards in smoothed deep reinforcement learning agents, outperforming existing methods and attacks across benchmarks.
Contribution
The study presents novel algorithms S-DQN and S-PPO that enhance robustness and rewards in smoothed DRL, filling a performance gap in current approaches.
Findings
S-DQN and S-PPO outperform existing smoothed agents by over 2x under strong attacks.
The new algorithms achieve higher clean rewards and robustness guarantees.
Smoothed Attack is nearly twice as effective in reducing rewards of smoothed agents.
Abstract
Robustness remains a paramount concern in deep reinforcement learning (DRL), with randomized smoothing emerging as a key technique for enhancing this attribute. However, a notable gap exists in the performance of current smoothed DRL agents, often characterized by significantly low clean rewards and weak robustness. In response to this challenge, our study introduces innovative algorithms aimed at training effective smoothed robust DRL agents. We propose S-DQN and S-PPO, novel approaches that demonstrate remarkable improvements in clean rewards, empirical robustness, and robustness guarantee across standard RL benchmarks. Notably, our S-DQN and S-PPO agents not only significantly outperform existing smoothed agents by an average factor of under the strongest attack, but also surpass previous robustly-trained agents by an average factor of . This represents a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Agent-Based Network Management · Formal Methods in Verification · Digital Rights Management and Security
MethodsRandomized Smoothing
