SplitGP: Achieving Both Generalization and Personalization in Federated   Learning

Dong-Jun Han; Do-Yeon Kim; Minseok Choi; Christopher G. Brinton,; Jaekyun Moon

arXiv:2212.08343·cs.LG·February 14, 2023·5 cites

SplitGP: Achieving Both Generalization and Personalization in Federated Learning

Dong-Jun Han, Do-Yeon Kim, Minseok Choi, Christopher G. Brinton,, Jaekyun Moon

PDF

Open Access

TL;DR

SplitGP is a split learning approach for federated learning that balances personalization and generalization, enabling efficient inference on resource-constrained edge devices by splitting the model into client and server components.

Contribution

It introduces a novel split learning framework that separately optimizes for personalization on the client side and generalization on the server side, with theoretical convergence analysis and practical bounds.

Findings

01

Outperforms baselines in inference time and accuracy

02

Balances personalization and generalization effectively

03

Provides theoretical analysis of convergence and split ratios

Abstract

A fundamental challenge to providing edge-AI services is the need for a machine learning (ML) model that achieves personalization (i.e., to individual clients) and generalization (i.e., to unseen data) properties concurrently. Existing techniques in federated learning (FL) have encountered a steep tradeoff between these objectives and impose large computational requirements on edge devices during training and inference. In this paper, we propose SplitGP, a new split learning solution that can simultaneously capture generalization and personalization capabilities for efficient inference across resource-constrained clients (e.g., mobile/IoT devices). Our key idea is to split the full ML model into client-side and server-side components, and impose different roles to them: the client-side model is trained to have strong personalization capability optimized to each client's main task, while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Recommender Systems and Techniques

MethodsTest