A Mutual Information Perspective on Federated Contrastive Learning
Christos Louizos, Matthias Reisser, Denis Korzhenkov

TL;DR
This paper explores federated contrastive learning through mutual information, proposing extensions for semi-supervised settings and analyzing the impact of data heterogeneity on performance.
Contribution
It introduces a mutual information perspective to federated contrastive learning, extending SimCLR for semi-supervised scenarios and analyzing non-i.i.d. data effects.
Findings
Adding user verification improves global mutual information bounds.
Supervised SimCLR can be achieved with label-based contrastive loss and auxiliary classification.
Global mutual information maximization benefits some non-i.i.d. data sources but not others.
Abstract
We investigate contrastive learning in the federated setting through the lens of SimCLR and multi-view mutual information maximization. In doing so, we uncover a connection between contrastive representation learning and user verification; by adding a user verification loss to each client's local SimCLR loss we recover a lower bound to the global multi-view mutual information. To accommodate for the case of when some labelled data are available at the clients, we extend our SimCLR variant to the federated semi-supervised setting. We see that a supervised SimCLR objective can be obtained with two changes: a) the contrastive loss is computed between datapoints that share the same label and b) we require an additional auxiliary head that predicts the correct labels from either of the two views. Along with the proposed SimCLR extensions, we also study how different sources of…
Peer Reviews
Decision·ICLR 2024 spotlight
- The problem of pretraining large models in a federated setting is quite important and has seen little progress so far. - The proposed LB on the global multi-view objective is principled and as the authors show amenable to federated training. - Experiments in the semi-supervised setting are a nice addition to the paper, and clearly shows that their objective can be built upon.
- The paper lacks convergence analysis of their optimization algorithm, which is quite common in FL papers. - Experiments on more challenging/heterogeneous benchmarks like ImageNet are missing. - Discussion on how their objective can be adapted to other centralized pretraining objectives is missing. (See questions) - (Minor/Nit) Proposition 2 need not be stated, it follows immediately from previous Lemmas.
- unsupervised federated representation learning is an important and interesting use-case - the method theoretically motivated and sound
- some baselines for semi-supervised learning, and some proper supervised baselines are missing. - The empirical results show that the proposed federated SimCLR variant is only en par with spectral constrastive learning when using a user verification loss. This is not properly discussed.
The extension of SimCLR to the federated setting and the exploration of MI maximization in this context is particularly given the increasing interest in FL. The paper provides a theoretical foundation for the proposed methods, including the connection between contrastive learning and user verification, and the derivation of a lower bound to the global multi-view MI. The authors conduct both unsupervised and semi-supervised experiments, providing a thorough evaluation of their proposed method.
The theoretical derivations, propositions, and lemmas mainly connect to and extend existing methods, which might be perceived as a lack of novelty. For example, the idea of decomposed MI in (1) and (2) has been presented in Sordoni et al. (2021). The proofs of propositions and lemmas are quite standard which mostly follow the existing approaches in literature. It would be more beneficial if the authors can clarify the unique aspects and advantages of their approach, and clearly differentiate it
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Face and Expression Recognition · Internet Traffic Analysis and Secure E-voting
MethodsBitcoin Customer Service Number +1-833-534-1729 · Average Pooling · Convolution · Kaiming Initialization · Max Pooling · Global Average Pooling · Dense Connections · Random Gaussian Blur · Feedforward Network · Random Resized Crop
