Safe Multi-Agent Deep Reinforcement Learning for Privacy-Aware Edge-Device Collaborative DNN Inference

Hong Wang; Xuwei Fan; Zhipeng Cheng; Yachao Yuan; Minghui Min; Minghui Liwang; Xiaoyu Xia

arXiv:2603.00129·cs.MA·March 3, 2026

Safe Multi-Agent Deep Reinforcement Learning for Privacy-Aware Edge-Device Collaborative DNN Inference

Hong Wang, Xuwei Fan, Zhipeng Cheng, Yachao Yuan, Minghui Min, Minghui Liwang, Xiaoyu Xia

PDF

Open Access

TL;DR

This paper introduces a hierarchical, reinforcement learning-based framework for privacy-aware, resource-efficient DNN inference on edge devices, optimizing delay, energy, and privacy under dynamic conditions.

Contribution

It proposes a novel Hierarchical Constrained Multi-Agent PPO algorithm with Lagrangian relaxation for adaptive, safe, and efficient model deployment and resource management in edge-device collaborative inference.

Findings

01

HC-MAPPO-L consistently satisfies delay constraints.

02

It achieves a better balance of energy consumption and privacy cost.

03

Outperforms baseline algorithms across various scenarios.

Abstract

As Deep Neural Network (DNN) inference becomes increasingly prevalent on edge and mobile platforms, critical challenges emerge in privacy protection, resource constraints, and dynamic model deployment. This paper proposes a privacy-aware collaborative inference framework, in which adaptive model partitioning is performed across edge devices and servers. To jointly optimize inference delay, energy consumption, and privacy cost under dynamic service demands and resource constraints, we formulate the joint problem as a Constrained Markov Decision Process (CMDP) that integrates model deployment, user-server association, model partitioning, and resource allocation. We propose a Hierarchical Constrained Multi-Agent Proximal Policy Optimization with Lagrangian relaxation (HC-MAPPO-L) algorithm, a safe reinforcement learning-based framework that enhances Multi-Agent Proximal Policy Optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing · Privacy-Preserving Technologies in Data · Advanced Neural Network Applications