OLAF: Programmable Data Plane Acceleration for Asynchronous Distributed Reinforcement Learning

Nehal Baganal Krishna; Anam Tahir; Firas Khamis; Mina Tahmasbi Arashloo; Michael Zink; Amr Rizk

arXiv:2507.05876·cs.NI·July 9, 2025

OLAF: Programmable Data Plane Acceleration for Asynchronous Distributed Reinforcement Learning

Nehal Baganal Krishna, Anam Tahir, Firas Khamis, Mina Tahmasbi Arashloo, Michael Zink, Amr Rizk

PDF

Open Access

TL;DR

This paper presents OLAF, a programmable data plane architecture that accelerates asynchronous distributed reinforcement learning by reducing model update staleness and network congestion through inline processing and novel queueing mechanisms.

Contribution

OLAF introduces a novel in-network processing architecture with a queueing mechanism and feedback control to mitigate staleness in distributed RL training.

Findings

01

Reduces model update staleness significantly.

02

Improves convergence rate in asynchronous DRL.

03

Maintains global fairness and responsiveness.

Abstract

Asynchronous Distributed Reinforcement Learning (DRL) can suffer from degraded convergence when model updates become stale, often the result of network congestion and packet loss during large-scale training. This work introduces a network data-plane acceleration architecture that mitigates such staleness by enabling inline processing of DRL model updates as they traverse the accelerator engine. To this end, we design and prototype a novel queueing mechanism that opportunistically combines compatible updates sharing a network element, reducing redundant traffic and preserving update utility. Complementing this we provide a lightweight transmission control mechanism at the worker nodes that is guided by feedback from the in-network accelerator. To assess model utility at line rate, we introduce the Age-of-Model (AoM) metric as a proxy for staleness and verify global fairness and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · IoT and Edge/Fog Computing · Software-Defined Networks and 5G