Taming the Instability: A Robust Second-Order Optimizer for Federated Learning over Non-IID Data

Yuanqiao Zhang; Tiantian He; Yuan Gao; Yixin Wang; Yew-Soon Ong; Maoguo Gong; A.K. Qin; Hui Li

arXiv:2603.28316·cs.LG·March 31, 2026

Taming the Instability: A Robust Second-Order Optimizer for Federated Learning over Non-IID Data

Yuanqiao Zhang, Tiantian He, Yuan Gao, Yixin Wang, Yew-Soon Ong, Maoguo Gong, A.K. Qin, Hui Li

PDF

TL;DR

FedRCO is a new second-order optimization method for federated learning that enhances stability, reduces communication costs, and improves convergence in non-IID data environments.

Contribution

It introduces a robust framework with real-time anomaly detection, stability protocols, and adaptive aggregation to address instability and inefficiency in distributed second-order optimization.

Findings

01

FedRCO outperforms existing methods in convergence speed and accuracy.

02

It effectively mitigates instability caused by non-IID data.

03

Experiments demonstrate higher robustness and efficiency in diverse scenarios.

Abstract

In this paper, we present Federated Robust Curvature Optimization (FedRCO), a novel second-order optimization framework designed to improve convergence speed and reduce communication cost in Federated Learning systems under statistical heterogeneity. Existing second-order optimization methods are often computationally expensive and numerically unstable in distributed settings. In contrast, FedRCO addresses these challenges by integrating an efficient approximate curvature optimizer with a provable stability mechanism. Specifically, FedRCO incorporates three key components: (1) a Gradient Anomaly Monitor that detects and mitigates exploding gradients in real-time, (2) a Fail-Safe Resilience protocol that resets optimization states upon numerical instability, and (3) a Curvature-Preserving Adaptive Aggregation strategy that safely integrates global knowledge without erasing the local…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.