A Reinforcement Learning-Based Approach to Graph Discovery in D2D-Enabled Federated Learning
Satyavrat Wagle, Anindya Bijoy Das, David J. Love, Christopher G., Brinton

TL;DR
This paper introduces a decentralized reinforcement learning approach for discovering D2D communication graphs in federated learning, enhancing convergence and resilience while addressing privacy and trust issues.
Contribution
It proposes a novel RL-based decentralized method for D2D graph discovery that balances privacy, trust, and resource efficiency in federated learning.
Findings
Faster convergence in federated learning with D2D communication.
Improved resilience to stragglers in FL.
Effective graph discovery balancing privacy and trust.
Abstract
Augmenting federated learning (FL) with direct device-to-device (D2D) communications can help improve convergence speed and reduce model bias through rapid local information exchange. However, data privacy concerns, device trust issues, and unreliable wireless channels each pose challenges to determining an effective yet resource efficient D2D structure. In this paper, we develop a decentralized reinforcement learning (RL) methodology for D2D graph discovery that promotes communication of non-sensitive yet impactful data-points over trusted yet reliable links. Each device functions as an RL agent, training a policy to predict the impact of incoming links. Local (device-level) and global rewards are coupled through message passing within and between device clusters. Numerical experiments confirm the advantages offered by our method in terms of convergence speed and straggler resilience…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
