TL;DR
Flotilla is a scalable, modular federated learning framework designed for heterogeneous edge resources, supporting asynchronous aggregation, fault tolerance, and real-world deployment, outperforming existing frameworks in resource efficiency and scalability.
Contribution
Introduces Flotilla, a flexible federated learning framework supporting asynchronous strategies, fault tolerance, and real-world deployment on diverse hardware, with a modular and resource-efficient design.
Findings
Supports multiple FL strategies with modular design
Demonstrates fault tolerance with rapid failover
Achieves comparable or better resource usage and scalability than existing frameworks
Abstract
With the recent improvements in mobile and edge computing and rising concerns of data privacy, Federated Learning(FL) has rapidly gained popularity as a privacy-preserving, distributed machine learning methodology. Several FL frameworks have been built for testing novel FL strategies. However, most focus on validating the learning aspects of FL through pseudo-distributed simulation but not for deploying on real edge hardware in a distributed manner to meaningfully evaluate the federated aspects from a systems perspective. Current frameworks are also inherently not designed to support asynchronous aggregation, which is gaining popularity, and have limited resilience to client and server failures. We introduce Flotilla, a scalable and lightweight FL framework. It adopts a ``user-first'' modular design to help rapidly compose various synchronous and asynchronous FL strategies while being…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
