Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL   Agents

Chung-En Sun; Sicun Gao; Tsui-Wei Weng

arXiv:2406.18062·cs.LG·June 27, 2024

Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents

Chung-En Sun, Sicun Gao, Tsui-Wei Weng

PDF

Open Access 1 Repo

TL;DR

This paper introduces S-DQN and S-PPO algorithms that significantly improve robustness and clean rewards in smoothed deep reinforcement learning agents, outperforming existing methods and attacks across benchmarks.

Contribution

The study presents novel algorithms S-DQN and S-PPO that enhance robustness and rewards in smoothed DRL, filling a performance gap in current approaches.

Findings

01

S-DQN and S-PPO outperform existing smoothed agents by over 2x under strong attacks.

02

The new algorithms achieve higher clean rewards and robustness guarantees.

03

Smoothed Attack is nearly twice as effective in reducing rewards of smoothed agents.

Abstract

Robustness remains a paramount concern in deep reinforcement learning (DRL), with randomized smoothing emerging as a key technique for enhancing this attribute. However, a notable gap exists in the performance of current smoothed DRL agents, often characterized by significantly low clean rewards and weak robustness. In response to this challenge, our study introduces innovative algorithms aimed at training effective smoothed robust DRL agents. We propose S-DQN and S-PPO, novel approaches that demonstrate remarkable improvements in clean rewards, empirical robustness, and robustness guarantee across standard RL benchmarks. Notably, our S-DQN and S-PPO agents not only significantly outperform existing smoothed agents by an average factor of $2.16 \times$ under the strongest attack, but also surpass previous robustly-trained agents by an average factor of $2.13 \times$ . This represents a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

trustworthy-ml-lab/robust_highutil_smoothed_drl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Agent-Based Network Management · Formal Methods in Verification · Digital Rights Management and Security

MethodsRandomized Smoothing