TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent   Kernels

Yaodong Yu; Alexander Wei; Sai Praneeth Karimireddy; Yi Ma; and Michael I. Jordan

arXiv:2207.06343·cs.LG·October 6, 2022·5 cites

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

Yaodong Yu, Alexander Wei, Sai Praneeth Karimireddy, Yi Ma, and Michael I. Jordan

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

This paper introduces TCT, a method that improves federated learning performance with dissimilar data by convexifying the problem using neural tangent kernels, leading to significant accuracy gains.

Contribution

The paper proposes a novel TCT procedure that combines feature learning with convexified optimization via neural tangent kernels to address non-convexity issues in federated learning.

Findings

01

Achieves up to +36% accuracy on FMNIST

02

Achieves up to +37% accuracy on CIFAR10

03

Significantly improves federated learning with dissimilar data

Abstract

State-of-the-art federated learning methods can perform far worse than their centralized counterparts when clients have dissimilar data distributions. For neural networks, even when centralized SGD easily finds a solution that is simultaneously performant for all clients, current federated optimization methods fail to converge to a comparable solution. We show that this performance disparity can largely be attributed to optimization challenges presented by nonconvexity. Specifically, we find that the early layers of the network do learn useful features, but the final layers fail to make use of them. That is, federated optimization applied to this non-convex problem distorts the learning of the final layers. Leveraging this observation, we propose a Train-Convexify-Train (TCT) procedure to sidestep this issue: first, learn features using off-the-shelf methods (e.g., FedAvg); then,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yaodongyu/tct
pytorchOfficial

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques

MethodsStochastic Gradient Descent