CatFedAvg: Optimising Communication-efficiency and Classification Accuracy in Federated Learning
Dipankar Sarkar, Sumit Rai, Ankur Narang

TL;DR
CatFedAvg is a federated learning algorithm that enhances communication efficiency and model accuracy by using category coverage maximization and client data structure meta-data collection, outperforming FedAvg especially in imbalanced data scenarios.
Contribution
The paper introduces CatFedAvg, a unified federated learning approach that simultaneously improves communication efficiency and learning quality through category coverage and data structure meta-data strategies.
Findings
Achieves 10% higher accuracy on MNIST with 70% less network transfer.
Performs better than FedAvg under extreme data imbalance conditions.
Demonstrates robustness across multiple vision classification datasets.
Abstract
Federated learning has allowed the training of statistical models over remote devices without the transfer of raw client data. In practice, training in heterogeneous and large networks introduce novel challenges in various aspects like network load, quality of client data, security and privacy. Recent works in FL have worked on improving communication efficiency and addressing uneven client data distribution independently, but none have provided a unified solution for both challenges. We introduce a new family of Federated Learning algorithms called CatFedAvg which not only improves the communication efficiency but improves the quality of learning using a category coverage maximization strategy. We use the FedAvg framework and introduce a simple and efficient step every epoch to collect meta-data about the client's training data structure which the central server uses to request a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing · Traffic Prediction and Management Techniques
