Fractional-Order Federated Learning
Mohammad Partohaghighi, Roummel Marcia, YangQuan Chen

TL;DR
This paper introduces FOFedAvg, a fractional-order federated learning algorithm that enhances convergence speed and robustness by incorporating memory-aware fractional updates, proven to converge under standard assumptions.
Contribution
It proposes a novel fractional-order federated averaging method that improves communication efficiency and convergence in non-IID data settings, with theoretical convergence guarantees.
Findings
FOFedAvg outperforms baseline algorithms on multiple benchmark datasets.
Fractional-order updates improve communication efficiency and convergence speed.
Theoretical proof of convergence under standard assumptions.
Abstract
Federated learning (FL) allows remote clients to train a global model collaboratively while protecting client privacy. Despite its privacy-preserving benefits, FL has significant drawbacks, including slow convergence, high communication cost, and non-independent-and-identically-distributed (non-IID) data. In this work, we present a novel FedAvg variation called Fractional-Order Federated Averaging (FOFedAvg), which incorporates Fractional-Order Stochastic Gradient Descent (FOSGD) to capture long-range relationships and deeper historical information. By introducing memory-aware fractional-order updates, FOFedAvg improves communication efficiency and accelerates convergence while mitigating instability caused by heterogeneous, non-IID client data. We compare FOFedAvg against a broad set of established federated optimization algorithms on benchmark datasets including MNIST, FEMNIST,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Data and IoT Technologies · Stochastic Gradient Optimization Techniques
