Comfetch: Federated Learning of Large Networks on Constrained Clients via Sketching
Tahseen Rabbani, Brandon Feng, Marco Bornstein, Kyle Rui Sang, Yifan, Yang, Arjun Rajkumar, Amitabh Varshney, Furong Huang

TL;DR
Comfetch enables federated learning of large neural networks on resource-constrained clients by using sketching techniques to reduce communication and computation costs, while maintaining competitive accuracy.
Contribution
Introduces Comfetch, a novel algorithm that employs count sketching to facilitate training of large models in federated settings with limited client resources.
Findings
Achieves competitive accuracy on CIFAR10/100 datasets.
Reduces communication and memory costs for clients.
Provides convergence guarantees for nonconvex training.
Abstract
Federated learning (FL) is a popular paradigm for private and collaborative model training on the edge. In centralized FL, the parameters of a global architecture (such as a deep neural network) are maintained and distributed by a central server/controller to clients who transmit model updates (gradients) back to the server based on local optimization. While many efforts have focused on reducing the communication complexity of gradient transmission, the vast majority of compression-based algorithms assume that each participating client is able to download and train the current and full set of parameters, which may not be a practical assumption depending on the resource constraints of smaller clients such as mobile devices. In this work, we propose a simple yet effective novel algorithm, Comfetch, which allows clients to train large networks using reduced representations of the global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning
MethodsTest · Tanh Activation · Sigmoid Activation · Long Short-Term Memory
