# Decentralized Computation Offloading for Multi-User Mobile Edge   Computing: A Deep Reinforcement Learning Approach

**Authors:** Zhao Chen, Xiaodong Wang

arXiv: 1812.07394 · 2020-10-20

## TL;DR

This paper proposes a deep reinforcement learning approach using DDPG for decentralized computation offloading in multi-user MEC systems, effectively reducing power consumption and delay through adaptive power allocation.

## Contribution

It introduces a novel DDPG-based decentralized offloading strategy that independently learns efficient policies at each user, outperforming existing methods.

## Key findings

- DDPG-based strategy reduces computation cost compared to DQN and greedy methods.
- Efficient policies are learned independently at each user.
- Power-delay tradeoff is analyzed for the proposed strategies.

## Abstract

Mobile edge computing (MEC) emerges recently as a promising solution to relieve resource-limited mobile devices from computation-intensive tasks, which enables devices to offload workloads to nearby MEC servers and improve the quality of computation experience. Nevertheless, by considering a MEC system consisting of multiple mobile users with stochastic task arrivals and wireless channels in this paper, the design of computation offloading policies is challenging to minimize the long-term average computation cost in terms of power consumption and buffering delay. A deep reinforcement learning (DRL) based decentralized dynamic computation offloading strategy is investigated to build a scalable MEC system with limited feedback. Specifically, a continuous action space-based DRL approach named deep deterministic policy gradient (DDPG) is adopted to learn efficient computation offloading policies independently at each mobile user. Thus, powers of both local execution and task offloading can be adaptively allocated by the learned policies from each user's local observation of the MEC system. Numerical results are illustrated to demonstrate that efficient policies can be learned at each user, and performance of the proposed DDPG based decentralized strategy outperforms the conventional deep Q-network (DQN) based discrete power control strategy and some other greedy strategies with reduced computation cost. Besides, the power-delay tradeoff is also analyzed for both the DDPG based and DQN based strategies.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.07394/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/1812.07394/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/1812.07394/full.md

---
Source: https://tomesphere.com/paper/1812.07394