Single-shot Hyper-parameter Optimization for Federated Learning: A   General Algorithm & Analysis

Yi Zhou; Parikshit Ram; Theodoros Salonidis; Nathalie Baracaldo; Horst; Samulowitz; Heiko Ludwig

arXiv:2202.08338·cs.LG·February 18, 2022·1 cites

Single-shot Hyper-parameter Optimization for Federated Learning: A General Algorithm & Analysis

Yi Zhou, Parikshit Ram, Theodoros Salonidis, Nathalie Baracaldo, Horst, Samulowitz, Heiko Ludwig

PDF

Open Access

TL;DR

This paper introduces FLoRA, a novel federated learning hyper-parameter optimization framework that efficiently finds a single optimal hyper-parameter set with minimal communication, improving model accuracy across diverse datasets.

Contribution

FLoRA is a general, single-shot FL-HPO algorithm applicable to various ML models and data types, with theoretical analysis of its optimality gap considering data heterogeneity.

Findings

01

FLoRA achieves significant accuracy improvements over baselines.

02

FLoRA maintains robustness as the number of parties increases.

03

Theoretical analysis accounts for non-IID data heterogeneity.

Abstract

We address the relatively unexplored problem of hyper-parameter optimization (HPO) for federated learning (FL-HPO). We introduce Federated Loss SuRface Aggregation (FLoRA), a general FL-HPO solution framework that can address use cases of tabular data and any Machine Learning (ML) model including gradient boosting training algorithms and therefore further expands the scope of FL-HPO. FLoRA enables single-shot FL-HPO: identifying a single set of good hyper-parameters that are subsequently used in a single FL training. Thus, it enables FL-HPO solutions with minimal additional communication overhead compared to FL training without HPO. We theoretically characterize the optimality gap of FL-HPO, which explicitly accounts for the heterogeneous non-IID nature of the parties' local data distributions, a dominant characteristic of FL systems. Our empirical evaluation of FLoRA for multiple ML…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning

MethodsHyper-parameter optimization