Edge Delayed Deep Deterministic Policy Gradient: efficient continuous control for edge scenarios

Alberto Sinigaglia; Niccol\`o Turcato; Ruggero Carli; Gian Antonio Susto

arXiv:2412.06390·cs.LG·October 14, 2025

Edge Delayed Deep Deterministic Policy Gradient: efficient continuous control for edge scenarios

Alberto Sinigaglia, Niccol\`o Turcato, Ruggero Carli, Gian Antonio Susto

PDF

Open Access

TL;DR

This paper introduces EdgeD3, an efficient reinforcement learning algorithm optimized for edge scenarios, reducing computational resources by 30% while maintaining or improving performance compared to existing methods.

Contribution

EdgeD3 is a novel extension of DDPG that significantly reduces GPU time and memory usage, tailored for resource-constrained edge environments.

Findings

01

Achieves 25% less GPU time than standard DDPG.

02

Uses 30% less memory and computational resources.

03

Outperforms or matches state-of-the-art methods on benchmarks.

Abstract

Deep Reinforcement Learning is gaining increasing attention thanks to its capability to learn complex policies in high-dimensional settings. Recent advancements utilize a dual-network architecture to learn optimal policies through the Q-learning algorithm. However, this approach has notable drawbacks, such as an overestimation bias that can disrupt the learning process and degrade the performance of the resulting policy. To address this, novel algorithms have been developed that mitigate overestimation bias by employing multiple Q-functions. Edge scenarios, which prioritize privacy, have recently gained prominence. In these settings, limited computational resources pose a significant challenge for complex Machine Learning approaches, making the efficiency of algorithms crucial for their performance. In this work, we introduce a novel Reinforcement Learning algorithm tailored for edge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Age of Information Optimization · Stochastic processes and financial applications

MethodsSoftmax · Attention Is All You Need · Q-Learning