The Effect of Training Parameters and Mechanisms on Decentralized   Federated Learning based on MNIST Dataset

Zhuofan Zhang; Mi Zhou; Kaicheng Niu; and Chaouki Abdallah

arXiv:2108.03508·cs.LG·August 10, 2021·1 cites

The Effect of Training Parameters and Mechanisms on Decentralized Federated Learning based on MNIST Dataset

Zhuofan Zhang, Mi Zhou, Kaicheng Niu, and Chaouki Abdallah

PDF

Open Access 1 Repo

TL;DR

This paper explores how various training parameters and mechanisms affect decentralized federated learning performance on the MNIST dataset, highlighting robustness and failure modes in different setups.

Contribution

It introduces Decentralized Federated Learning (DFL) and systematically evaluates its robustness under various training modifications using the MNIST dataset.

Findings

01

Training procedures are generally robust but not optimal.

02

Large variance in model weights can cause training failures.

03

Partial sharing and non-IID data impact model convergence.

Abstract

Federated Learning is an algorithm suited for training models on decentralized data, but the requirement of a central "server" node is a bottleneck. In this document, we first introduce the notion of Decentralized Federated Learning (DFL). We then perform various experiments on different setups, such as changing model aggregation frequency, switching from independent and identically distributed (IID) dataset partitioning to non-IID partitioning with partial global sharing, using different optimization methods across clients, and breaking models into segments with partial sharing. All experiments are run on the MNIST handwritten digits dataset. We observe that those altered training procedures are generally robust, albeit non-optimal. We also observe failures in training when the variance between model weights is too large. The open-source experiment code is accessible through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhzhang2018/decentralizedfl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting · Stochastic Gradient Optimization Techniques