L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated   Self-Supervised Visual Representation Learning

Yasar Abbas Ur Rehman; Yan Gao; Pedro Porto Buarque de Gusm\~ao; Mina; Alibeigi; Jiajun Shen; Nicholas D. Lane

arXiv:2307.07393·cs.CV·July 17, 2023·6 cites

L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning

Yasar Abbas Ur Rehman, Yan Gao, Pedro Porto Buarque de Gusm\~ao, Mina, Alibeigi, Jiajun Shen, Nicholas D. Lane

PDF

Open Access 1 Repo

TL;DR

This paper introduces L-DAWA, a novel federated learning aggregation method that mitigates client bias and divergence in self-supervised visual representation learning, leading to state-of-the-art results on multiple datasets.

Contribution

The paper proposes a layer-wise divergence-aware weight aggregation strategy for federated SSL, addressing client divergence issues during model aggregation.

Findings

01

Achieves new state-of-the-art performance on CIFAR-10/100 and Tiny ImageNet.

02

Effectively reduces client bias and divergence during federated aggregation.

03

Improves robustness and quality of visual representations in federated SSL.

Abstract

The ubiquity of camera-enabled devices has led to large amounts of unlabeled image data being produced at the edge. The integration of self-supervised learning (SSL) and federated learning (FL) into one coherent system can potentially offer data privacy guarantees while also advancing the quality and robustness of the learned visual representations without needing to move data around. However, client bias and divergence during FL aggregation caused by data heterogeneity limits the performance of learned visual representations on downstream tasks. In this paper, we propose a new aggregation strategy termed Layer-wise Divergence Aware Weight Aggregation (L-DAWA) to mitigate the influence of client bias and divergence during FL aggregation. The proposed method aggregates weights at the layer-level according to the measure of angular divergence between the clients' model and the global…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yasar-rehman/L-DAWA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

MethodsAttentive Walk-Aggregating Graph Neural Network