A Reinforcement Learning-Based Approach to Graph Discovery in   D2D-Enabled Federated Learning

Satyavrat Wagle; Anindya Bijoy Das; David J. Love; Christopher G.; Brinton

arXiv:2308.03933·eess.SP·August 9, 2023·GLOBECOM

A Reinforcement Learning-Based Approach to Graph Discovery in D2D-Enabled Federated Learning

Satyavrat Wagle, Anindya Bijoy Das, David J. Love, Christopher G., Brinton

PDF

Open Access

TL;DR

This paper introduces a decentralized reinforcement learning approach for discovering D2D communication graphs in federated learning, enhancing convergence and resilience while addressing privacy and trust issues.

Contribution

It proposes a novel RL-based decentralized method for D2D graph discovery that balances privacy, trust, and resource efficiency in federated learning.

Findings

01

Faster convergence in federated learning with D2D communication.

02

Improved resilience to stragglers in FL.

03

Effective graph discovery balancing privacy and trust.

Abstract

Augmenting federated learning (FL) with direct device-to-device (D2D) communications can help improve convergence speed and reduce model bias through rapid local information exchange. However, data privacy concerns, device trust issues, and unreliable wireless channels each pose challenges to determining an effective yet resource efficient D2D structure. In this paper, we develop a decentralized reinforcement learning (RL) methodology for D2D graph discovery that promotes communication of non-sensitive yet impactful data-points over trusted yet reliable links. Each device functions as an RL agent, training a policy to predict the impact of incoming links. Local (device-level) and global rewards are coupled through message passing within and between device clusters. Numerical experiments confirm the advantages offered by our method in terms of convergence speed and straggler resilience…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings