When Foresight Pruning Meets Zeroth-Order Optimization: Efficient Federated Learning for Low-Memory Devices
Pengyu Zhang, Yingjie Liu, Yingbo Zhou, Xiao Du, Xian Wei, Ting Wang,, Mingsong Chen

TL;DR
This paper introduces a federated foresight pruning method based on Neural Tangent Kernel (NTK) that reduces memory usage and computational overhead in federated learning, making it feasible for low-memory AIoT devices.
Contribution
It proposes a novel NTK-based federated foresight pruning approach that integrates with BP-Free training to significantly lower memory and FLOPs while maintaining model performance.
Findings
Memory reduction up to 9x achieved.
Significant decrease in FLOPs with performance boost.
Effective in both simulation and real test-bed environments.
Abstract
Although Federated Learning (FL) enables collaborative learning in Artificial Intelligence of Things (AIoT) design, it fails to work on low-memory AIoT devices due to its heavy memory usage. To address this problem, various federated pruning methods are proposed to reduce memory usage during inference. However, few of them can substantially mitigate the memory burdens during pruning and training. As an alternative, zeroth-order or backpropagation-free (BP-Free) methods can partially alleviate the memory consumption, but they suffer from scaling up and large computation overheads, since the gradient estimation error and floating point operations (FLOPs) increase as the dimensionality of the model parameters grows. In this paper, we propose a federated foresight pruning method based on Neural Tangent Kernel (NTK), which can seamlessly integrate with federated BP-Free training frameworks.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Ferroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing
MethodsNeural Tangent Kernel · Pruning
