High-Dimensional Analysis of Bootstrap Ensemble Classifiers

Malik Tiomoko; Hamza Cherkaoui; Mohamed El Amine Seddik; Cosme Louart; Ekkehard Schnoor; Balazs Kegl

arXiv:2505.14587·stat.ML·May 14, 2026

High-Dimensional Analysis of Bootstrap Ensemble Classifiers

Malik Tiomoko, Hamza Cherkaoui, Mohamed El Amine Seddik, Cosme Louart, Ekkehard Schnoor, Balazs Kegl

PDF

TL;DR

This paper provides a theoretical analysis of bootstrap ensemble classifiers, specifically LSSVM, in high-dimensional settings, using Random Matrix Theory to guide parameter selection and validate with experiments.

Contribution

It offers new theoretical insights into bootstrap methods for high-dimensional LSSVM ensembles and proposes strategies for optimal parameter choices.

Findings

01

Bootstrap methods improve LSSVM performance in high dimensions.

02

Theoretical guidelines for selecting number of subsets and regularization.

03

Empirical validation confirms theoretical predictions.

Abstract

Bootstrap methods have long been the cornerstone of ensemble learning in machine learning. This paper presents a theoretical analysis of bootstrap techniques applied to the Least Square Support Vector Machine (LSSVM) ensemble in the context of large and growing sample sizes and feature dimensionalities. Using tools from Random Matrix Theory, we investigate the performance of this classifier that aggregates decision functions from multiple weak classifiers, each trained on different subsets of the data. We provide insights into the use of bootstrap methods in high-dimensional settings, enhancing our understanding of their impact. Based on these findings, we propose strategies to select the number of subsets and the regularization parameter that maximize the performance of the LSSVM. Empirical experiments on synthetic and real-world datasets validate our theoretical results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.