Taming the Instability: A Robust Second-Order Optimizer for Federated Learning over Non-IID Data
Yuanqiao Zhang, Tiantian He, Yuan Gao, Yixin Wang, Yew-Soon Ong, Maoguo Gong, A.K. Qin, Hui Li

TL;DR
FedRCO is a new second-order optimization method for federated learning that enhances stability, reduces communication costs, and improves convergence in non-IID data environments.
Contribution
It introduces a robust framework with real-time anomaly detection, stability protocols, and adaptive aggregation to address instability and inefficiency in distributed second-order optimization.
Findings
FedRCO outperforms existing methods in convergence speed and accuracy.
It effectively mitigates instability caused by non-IID data.
Experiments demonstrate higher robustness and efficiency in diverse scenarios.
Abstract
In this paper, we present Federated Robust Curvature Optimization (FedRCO), a novel second-order optimization framework designed to improve convergence speed and reduce communication cost in Federated Learning systems under statistical heterogeneity. Existing second-order optimization methods are often computationally expensive and numerically unstable in distributed settings. In contrast, FedRCO addresses these challenges by integrating an efficient approximate curvature optimizer with a provable stability mechanism. Specifically, FedRCO incorporates three key components: (1) a Gradient Anomaly Monitor that detects and mitigates exploding gradients in real-time, (2) a Fail-Safe Resilience protocol that resets optimization states upon numerical instability, and (3) a Curvature-Preserving Adaptive Aggregation strategy that safely integrates global knowledge without erasing the local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
