FedZeN: Towards superlinear zeroth-order federated learning via   incremental Hessian estimation

Alessio Maritan; Subhrakanti Dey; Luca Schenato

arXiv:2309.17174·cs.LG·October 2, 2023·1 cites

FedZeN: Towards superlinear zeroth-order federated learning via incremental Hessian estimation

Alessio Maritan, Subhrakanti Dey, Luca Schenato

PDF

Open Access

TL;DR

FedZeN introduces a novel federated zeroth-order optimization algorithm that estimates the Hessian incrementally, achieving superlinear convergence in convex settings while maintaining communication efficiency and privacy.

Contribution

This work is the first to develop a federated zeroth-order method with incremental Hessian estimation for superlinear convergence in convex optimization.

Findings

01

Achieves local quadratic convergence with high probability.

02

Demonstrates global linear convergence up to zeroth-order precision.

03

Outperforms existing federated zeroth-order methods in simulations.

Abstract

Federated learning is a distributed learning framework that allows a set of clients to collaboratively train a model under the orchestration of a central server, without sharing raw data samples. Although in many practical scenarios the derivatives of the objective function are not available, only few works have considered the federated zeroth-order setting, in which functions can only be accessed through a budgeted number of point evaluations. In this work we focus on convex optimization and design the first federated zeroth-order algorithm to estimate the curvature of the global objective, with the purpose of achieving superlinear convergence. We take an incremental Hessian estimator whose error norm converges linearly, and we adapt it to the federated zeroth-order setting, sampling the random search directions from the Stiefel manifold for improved performance. In particular, both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning

MethodsRandom Search · Focus