Neural Collapse Inspired Federated Learning with Non-iid Data
Chenxi Huang, Liang Xie, Yibo Yang, Wenxiao Wang, Binbin, Lin, Deng Cai

TL;DR
This paper introduces a federated learning method inspired by neural collapse, using a fixed global structure and category memory vectors to improve performance and convergence on non-iid data.
Contribution
It proposes a novel approach that leverages neural collapse principles and fixed global structures to enhance federated learning with non-iid data.
Findings
Improved accuracy on heterogeneous datasets
Faster convergence compared to baseline methods
Effective handling of non-iid data distributions
Abstract
One of the challenges in federated learning is the non-independent and identically distributed (non-iid) characteristics between heterogeneous devices, which cause significant differences in local updates and affect the performance of the central server. Although many studies have been proposed to address this challenge, they only focus on local training and aggregation processes to smooth the changes and fail to achieve high performance with deep learning models. Inspired by the phenomenon of neural collapse, we force each client to be optimized toward an optimal global structure for classification. Specifically, we initialize it as a random simplex Equiangular Tight Frame (ETF) and fix it as the unit optimization target of all clients during the local updating. After guaranteeing all clients are learning to converge to the global optimum, we propose to add a global memory vector for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Age of Information Optimization · Stochastic Gradient Optimization Techniques
Methodsfail · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
