Monotonic Learning in the PAC Framework: A New Perspective
Ming Li, Chenyi Zhang, Qin Li

TL;DR
This paper introduces a PAC-based theoretical framework demonstrating that certain learning algorithms exhibit monotonic improvement in generalization error as training data increases, under specific conditions.
Contribution
It provides a novel theoretical risk distribution showing monotonicity in learning performance and identifies conditions for ERM algorithms to be monotone.
Findings
Monotonicity holds when hypothesis space is finite or has finite VC-dimension.
Theoretical risk bounds converge monotonically to zero with increasing data.
Experimental validation confirms theoretical predictions.
Abstract
Monotone learning describes learning processes in which expected performance consistently improves as the amount of training data increases. However, recent studies challenge this conventional wisdom, revealing significant gaps in the understanding of generalization in machine learning. Addressing these gaps is crucial for advancing the theoretical foundations of the field. In this work, we utilize Probably Approximately Correct (PAC) learning theory to construct a theoretical risk distribution that approximates a learning algorithm's actual performance. We rigorously prove that this theoretical distribution exhibits monotonicity as sample sizes increase. We identify two scenarios under which deterministic algorithms based on Empirical Risk Minimization (ERM) are monotone: (1) the hypothesis space is finite, or (2) the hypothesis space has finite VC-dimension. Experiments on two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Face and Expression Recognition
