Optimisation of federated learning settings under statistical   heterogeneity variations

Basem Suleiman; Muhammad Johan Alibasa; Rizka Widyarini Purwanto,; Lewis Jeffries; Ali Anaissi; Jacky Song

arXiv:2406.06340·cs.LG·June 11, 2024

Optimisation of federated learning settings under statistical heterogeneity variations

Basem Suleiman, Muhammad Johan Alibasa, Rizka Widyarini Purwanto,, Lewis Jeffries, Ali Anaissi, Jacky Song

PDF

Open Access

TL;DR

This paper empirically analyzes how federated learning performance varies with statistical heterogeneity, proposing strategies and guidelines to optimize model training across diverse data distributions.

Contribution

It introduces a systematic data partitioning method, a heterogeneity metric, and provides empirical guidelines for selecting FL parameters and aggregators based on data characteristics.

Findings

01

Optimal FL parameters vary with data heterogeneity levels.

02

Certain aggregators perform better under specific heterogeneity conditions.

03

Guidelines improve FL model performance across diverse datasets.

Abstract

Federated Learning (FL) enables local devices to collaboratively learn a shared predictive model by only periodically sharing model parameters with a central aggregator. However, FL can be disadvantaged by statistical heterogeneity produced by the diversity in each local devices data distribution, which creates different levels of Independent and Identically Distributed (IID) data. Furthermore, this can be more complex when optimising different combinations of FL parameters and choosing optimal aggregation. In this paper, we present an empirical analysis of different FL training parameters and aggregators over various levels of statistical heterogeneity on three datasets. We propose a systematic data partition strategy to simulate different levels of statistical heterogeneity and a metric to measure the level of IID. Additionally, we empirically identify the best FL model and key…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Privacy-Preserving Technologies in Data