Edge Delayed Deep Deterministic Policy Gradient: efficient continuous control for edge scenarios
Alberto Sinigaglia, Niccol\`o Turcato, Ruggero Carli, Gian Antonio Susto

TL;DR
This paper introduces EdgeD3, an efficient reinforcement learning algorithm optimized for edge scenarios, reducing computational resources by 30% while maintaining or improving performance compared to existing methods.
Contribution
EdgeD3 is a novel extension of DDPG that significantly reduces GPU time and memory usage, tailored for resource-constrained edge environments.
Findings
Achieves 25% less GPU time than standard DDPG.
Uses 30% less memory and computational resources.
Outperforms or matches state-of-the-art methods on benchmarks.
Abstract
Deep Reinforcement Learning is gaining increasing attention thanks to its capability to learn complex policies in high-dimensional settings. Recent advancements utilize a dual-network architecture to learn optimal policies through the Q-learning algorithm. However, this approach has notable drawbacks, such as an overestimation bias that can disrupt the learning process and degrade the performance of the resulting policy. To address this, novel algorithms have been developed that mitigate overestimation bias by employing multiple Q-functions. Edge scenarios, which prioritize privacy, have recently gained prominence. In these settings, limited computational resources pose a significant challenge for complex Machine Learning approaches, making the efficiency of algorithms crucial for their performance. In this work, we introduce a novel Reinforcement Learning algorithm tailored for edge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Age of Information Optimization · Stochastic processes and financial applications
MethodsSoftmax · Attention Is All You Need · Q-Learning
