Analysis of ensemble learning using simple perceptrons based on online learning theory
Seiji Miyoshi, Kazuyuki Hara, Masato Okada

TL;DR
This paper analyzes the generalization error of ensemble learning with simple perceptrons using online learning theory and statistical mechanics, deriving differential equations for key parameters and comparing three learning rules.
Contribution
It introduces a theoretical framework to calculate ensemble generalization error using order parameters and derives differential equations for various learning rules.
Findings
AdaTron learning outperforms Hebbian and perceptron learning in maintaining student diversity.
Differential equations describe the dynamics of ensemble learning behavior.
Analytical results reveal different characteristics of learning rules in ensemble contexts.
Abstract
Ensemble learning of nonlinear perceptrons, which determine their outputs by sign functions, is discussed within the framework of online learning and statistical mechanics. One purpose of statistical learning theory is to theoretically obtain the generalization error. This paper shows that ensemble generalization error can be calculated by using two order parameters, that is, the similarity between a teacher and a student, and the similarity among students. The differential equations that describe the dynamical behaviors of these order parameters are derived in the case of general learning rules. The concrete forms of these differential equations are derived analytically in the cases of three well-known rules: Hebbian learning, perceptron learning and AdaTron learning. Ensemble generalization errors of these three rules are calculated by using the results determined by solving their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
