Learning the Hypotheses Space from data Part II: Convergence and Feasibility
Diego Marcondes, Adilson Simonis, Junior Barrera

TL;DR
This paper extends model selection theory by demonstrating that data-driven learning of hypotheses spaces converges to an optimal space, improving generalization and avoiding overfitting in complex models.
Contribution
It introduces a framework extending Vapnik-Chervonenkis theory to random hypotheses spaces learned from data, proving their convergence and efficiency.
Findings
Random hypotheses spaces learned from data converge to target spaces
Learning from data reduces generalization error compared to fixed spaces
The framework ensures asymptotic unbiasedness in model estimation
Abstract
In part \textit{I} we proposed a structure for a general Hypotheses Space , the Learning Space , which can be employed to avoid \textit{overfitting} when estimating in a complex space with relative shortage of examples. Also, we presented the U-curve property, which can be taken advantage of in order to select a Hypotheses Space without exhaustively searching . In this paper, we carry further our agenda, by showing the consistency of a model selection framework based on Learning Spaces, in which one selects from data the Hypotheses Space on which to learn. The method developed in this paper adds to the state-of-the-art in model selection, by extending Vapnik-Chervonenkis Theory to \textit{random} Hypotheses Spaces, i.e., Hypotheses Spaces learned from data. In this framework, one estimates a random subspace…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and Algorithms · Machine Learning and Data Classification
