Comfetch: Federated Learning of Large Networks on Constrained Clients   via Sketching

Tahseen Rabbani; Brandon Feng; Marco Bornstein; Kyle Rui Sang; Yifan; Yang; Arjun Rajkumar; Amitabh Varshney; Furong Huang

arXiv:2109.08346·cs.LG·October 3, 2023·1 cites

Comfetch: Federated Learning of Large Networks on Constrained Clients via Sketching

Tahseen Rabbani, Brandon Feng, Marco Bornstein, Kyle Rui Sang, Yifan, Yang, Arjun Rajkumar, Amitabh Varshney, Furong Huang

PDF

Open Access

TL;DR

Comfetch enables federated learning of large neural networks on resource-constrained clients by using sketching techniques to reduce communication and computation costs, while maintaining competitive accuracy.

Contribution

Introduces Comfetch, a novel algorithm that employs count sketching to facilitate training of large models in federated settings with limited client resources.

Findings

01

Achieves competitive accuracy on CIFAR10/100 datasets.

02

Reduces communication and memory costs for clients.

03

Provides convergence guarantees for nonconvex training.

Abstract

Federated learning (FL) is a popular paradigm for private and collaborative model training on the edge. In centralized FL, the parameters of a global architecture (such as a deep neural network) are maintained and distributed by a central server/controller to clients who transmit model updates (gradients) back to the server based on local optimization. While many efforts have focused on reducing the communication complexity of gradient transmission, the vast majority of compression-based algorithms assume that each participating client is able to download and train the current and full set of parameters, which may not be a practical assumption depending on the resource constraints of smaller clients such as mobile devices. In this work, we propose a simple yet effective novel algorithm, Comfetch, which allows clients to train large networks using reduced representations of the global…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning

MethodsTest · Tanh Activation · Sigmoid Activation · Long Short-Term Memory