ReBoot: Distributed statistical learning via refitting bootstrap samples

Yumeng Wang; Ziwei Zhu; Xuming He

arXiv:2207.09098·stat.ME·May 8, 2024

ReBoot: Distributed statistical learning via refitting bootstrap samples

Yumeng Wang, Ziwei Zhu, Xuming He

PDF

Open Access

TL;DR

ReBoot is a one-shot distributed learning algorithm that refits models on bootstrap samples, achieving full-sample statistical rates with minimal communication, and is extended to federated CNN training with superior early-round performance.

Contribution

This paper introduces ReBoot, a novel distributed learning method that refits bootstrap samples with only one communication round, and extends it to federated CNNs with FedReBoot.

Findings

01

ReBoot achieves the full-sample statistical rate for GLM and phase retrieval.

02

ReBoot's bias rate is sharper than model averaging, indicating higher data split tolerance.

03

FedReBoot outperforms FedAvg in early communication rounds for image classification.

Abstract

In this paper, we propose a one-shot distributed learning algorithm via refitting bootstrap samples, which we refer to as ReBoot. ReBoot refits a new model to mini-batches of bootstrap samples that are continuously drawn from each of the locally fitted models. It requires only one round of communication of model parameters without much memory. Theoretically, we analyze the statistical error rate of ReBoot for generalized linear models (GLM) and noisy phase retrieval, which represent convex and non-convex problems, respectively. In both cases, ReBoot provably achieves the full-sample statistical rate. In particular, we show that the systematic bias of ReBoot, the error that is independent of the number of subsamples (i.e., the number of sites), is $O (n^{- 2})$ in GLM, where $n$ is the subsample size (the sample size of each local site). This rate is sharper than that of model parameter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques · Machine Learning and ELM