Towards Non-I.I.D. and Invisible Data with FedNAS: Federated Deep Learning via Neural Architecture Search
Chaoyang He, Murali Annavaram, Salman Avestimehr

TL;DR
This paper introduces FedNAS, a neural architecture search method for federated learning that automatically finds better model architectures for non-IID data, improving accuracy over predefined models.
Contribution
The paper proposes FedNAS, a novel federated NAS algorithm that automates architecture search in FL, addressing non-IID data challenges and outperforming manual designs.
Findings
FedNAS outperforms predefined architectures on non-IID datasets.
Automated architecture search improves model accuracy in federated settings.
System implementation demonstrates practical viability of FedNAS.
Abstract
Federated Learning (FL) has been proved to be an effective learning framework when data cannot be centralized due to privacy, communication costs, and regulatory restrictions. When training deep learning models under an FL setting, people employ the predefined model architecture discovered in the centralized environment. However, this predefined architecture may not be the optimal choice because it may not fit data with non-identical and independent distribution (non-IID). Thus, we advocate automating federated learning (AutoFL) to improve model accuracy and reduce the manual design effort. We specifically study AutoFL via Neural Architecture Search (NAS), which can automate the design process. We propose a Federated NAS (FedNAS) algorithm to help scattered workers collaboratively searching for a better architecture with higher accuracy. We also build a system based on FedNAS. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Adversarial Robustness in Machine Learning
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
