SplitGP: Achieving Both Generalization and Personalization in Federated Learning
Dong-Jun Han, Do-Yeon Kim, Minseok Choi, Christopher G. Brinton,, Jaekyun Moon

TL;DR
SplitGP is a split learning approach for federated learning that balances personalization and generalization, enabling efficient inference on resource-constrained edge devices by splitting the model into client and server components.
Contribution
It introduces a novel split learning framework that separately optimizes for personalization on the client side and generalization on the server side, with theoretical convergence analysis and practical bounds.
Findings
Outperforms baselines in inference time and accuracy
Balances personalization and generalization effectively
Provides theoretical analysis of convergence and split ratios
Abstract
A fundamental challenge to providing edge-AI services is the need for a machine learning (ML) model that achieves personalization (i.e., to individual clients) and generalization (i.e., to unseen data) properties concurrently. Existing techniques in federated learning (FL) have encountered a steep tradeoff between these objectives and impose large computational requirements on edge devices during training and inference. In this paper, we propose SplitGP, a new split learning solution that can simultaneously capture generalization and personalization capabilities for efficient inference across resource-constrained clients (e.g., mobile/IoT devices). Our key idea is to split the full ML model into client-side and server-side components, and impose different roles to them: the client-side model is trained to have strong personalization capability optimized to each client's main task, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Recommender Systems and Techniques
MethodsTest
