Studying Various Activation Functions and Non-IID Data for Machine Learning Model Robustness
Long Dang, Thushari Hapuarachchi, Kaiqi Xiong, Jing Lin

TL;DR
This paper investigates how different activation functions and data distribution types affect the robustness of machine learning models under adversarial training, proposing methods to improve robustness especially in federated learning with non-IID data.
Contribution
It introduces an advanced adversarial training approach for centralized and federated environments, evaluates ten activation functions, and demonstrates data sharing benefits for robustness in non-IID federated data.
Findings
ReLU generally performs best among activation functions.
Robust accuracy drops significantly in federated non-IID settings.
Data sharing improves robustness, especially with 40% data sharing.
Abstract
Adversarial training is an effective method to improve the machine learning (ML) model robustness. Most existing studies typically consider the Rectified linear unit (ReLU) activation function and centralized training environments. In this paper, we study the ML model robustness using ten different activation functions through adversarial training in centralized environments and explore the ML model robustness in federal learning environments. In the centralized environment, we first propose an advanced adversarial training approach to improving the ML model robustness by incorporating model architecture change, soft labeling, simplified data augmentation, and varying learning rates. Then, we conduct extensive experiments on ten well-known activation functions in addition to ReLU to better understand how they impact the ML model robustness. Furthermore, we extend the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning
