Self-Boost via Optimal Retraining: An Analysis via Approximate Message Passing

Adel Javanmard; Rudrajit Das; Alessandro Epasto; Vahab Mirrokni

arXiv:2505.15195·cs.LG·May 22, 2025

Self-Boost via Optimal Retraining: An Analysis via Approximate Message Passing

Adel Javanmard, Rudrajit Das, Alessandro Epasto, Vahab Mirrokni

PDF

Open Access

TL;DR

This paper develops a theoretically optimal retraining framework for binary classification using approximate message passing, demonstrating how to best combine model predictions and labels to minimize error, especially under high noise.

Contribution

It introduces a principled AMP-based analysis to derive the Bayes optimal aggregator for iterative retraining, advancing understanding of optimal model self-boosting strategies.

Findings

01

Derives the Bayes optimal aggregator function for retraining.

02

Quantifies performance improvements over multiple retraining rounds.

03

Proposes a practical version of the optimal aggregator for high noise scenarios.

Abstract

Retraining a model using its own predictions together with the original, potentially noisy labels is a well-known strategy for improving the model performance. While prior works have demonstrated the benefits of specific heuristic retraining schemes, the question of how to optimally combine the model's predictions and the provided labels remains largely open. This paper addresses this fundamental question for binary classification tasks. We develop a principled framework based on approximate message passing (AMP) to analyze iterative retraining procedures for two ground truth settings: Gaussian mixture model (GMM) and generalized linear model (GLM). Our main contribution is the derivation of the Bayes optimal aggregator function to combine the current model's predictions and the given labels, which when used to retrain the same model, minimizes its prediction error. We also quantify the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed systems and fault tolerance · DNA and Biological Computing · Interconnection Networks and Systems