Monotonic Learning in the PAC Framework: A New Perspective

Ming Li; Chenyi Zhang; Qin Li

arXiv:2501.05493·cs.LG·May 22, 2025

Monotonic Learning in the PAC Framework: A New Perspective

Ming Li, Chenyi Zhang, Qin Li

PDF

Open Access

TL;DR

This paper introduces a PAC-based theoretical framework demonstrating that certain learning algorithms exhibit monotonic improvement in generalization error as training data increases, under specific conditions.

Contribution

It provides a novel theoretical risk distribution showing monotonicity in learning performance and identifies conditions for ERM algorithms to be monotone.

Findings

01

Monotonicity holds when hypothesis space is finite or has finite VC-dimension.

02

Theoretical risk bounds converge monotonically to zero with increasing data.

03

Experimental validation confirms theoretical predictions.

Abstract

Monotone learning describes learning processes in which expected performance consistently improves as the amount of training data increases. However, recent studies challenge this conventional wisdom, revealing significant gaps in the understanding of generalization in machine learning. Addressing these gaps is crucial for advancing the theoretical foundations of the field. In this work, we utilize Probably Approximately Correct (PAC) learning theory to construct a theoretical risk distribution that approximates a learning algorithm's actual performance. We rigorously prove that this theoretical distribution exhibits monotonicity as sample sizes increase. We identify two scenarios under which deterministic algorithms based on Empirical Risk Minimization (ERM) are monotone: (1) the hypothesis space is finite, or (2) the hypothesis space has finite VC-dimension. Experiments on two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Face and Expression Recognition