Accelerating Distributed Deep Reinforcement Learning by In-Network   Experience Sampling

Masaki Furukawa; Hiroki Matsutani

arXiv:2110.13506·cs.DC·March 13, 2023

Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling

Masaki Furukawa, Hiroki Matsutani

PDF

Open Access

TL;DR

This paper enhances distributed deep reinforcement learning by implementing DPDK-based network optimizations and an in-network experience replay server, significantly reducing communication latencies and improving training efficiency.

Contribution

It introduces an in-network experience replay memory server and DPDK-based optimizations to reduce communication overhead in distributed DQN training.

Findings

01

Kernel bypassing with DPDK reduces latency by up to 58.9%.

02

In-network experience replay server decreases experience access latency by up to 28.1%.

03

Communication latency for prioritized sampling is reduced by up to 29.1%.

Abstract

A computing cluster that interconnects multiple compute nodes is used to accelerate distributed reinforcement learning based on DQN (Deep Q-Network). In distributed reinforcement learning, Actor nodes acquire experiences by interacting with a given environment and a Learner node optimizes their DQN model. Since data transfer between Actor and Learner nodes increases depending on the number of Actor nodes and their experience size, communication overhead between them is one of major performance bottlenecks. In this paper, their communication is accelerated by DPDK-based network optimizations, and DPDK-based low-latency experience replay memory server is deployed between Actor and Learner nodes interconnected with a 40GbE (40Gbit Ethernet) network. Evaluation results show that, as a network optimization technique, kernel bypassing by DPDK reduces network access latencies to a shared…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware-Defined Networks and 5G · Functional Brain Connectivity Studies · Advanced Memory and Neural Computing

MethodsQ-Learning · Experience Replay · Dense Connections · Convolution · Deep Q-Network