Learning the Hypotheses Space from data Part II: Convergence and   Feasibility

Diego Marcondes; Adilson Simonis; Junior Barrera

arXiv:2001.11578·stat.ML·September 13, 2021·1 cites

Learning the Hypotheses Space from data Part II: Convergence and Feasibility

Diego Marcondes, Adilson Simonis, Junior Barrera

PDF

Open Access

TL;DR

This paper extends model selection theory by demonstrating that data-driven learning of hypotheses spaces converges to an optimal space, improving generalization and avoiding overfitting in complex models.

Contribution

It introduces a framework extending Vapnik-Chervonenkis theory to random hypotheses spaces learned from data, proving their convergence and efficiency.

Findings

01

Random hypotheses spaces learned from data converge to target spaces

02

Learning from data reduces generalization error compared to fixed spaces

03

The framework ensures asymptotic unbiasedness in model estimation

Abstract

In part \textit{I} we proposed a structure for a general Hypotheses Space $H$ , the Learning Space $L (H)$ , which can be employed to avoid \textit{overfitting} when estimating in a complex space with relative shortage of examples. Also, we presented the U-curve property, which can be taken advantage of in order to select a Hypotheses Space without exhaustively searching $L (H)$ . In this paper, we carry further our agenda, by showing the consistency of a model selection framework based on Learning Spaces, in which one selects from data the Hypotheses Space on which to learn. The method developed in this paper adds to the state-of-the-art in model selection, by extending Vapnik-Chervonenkis Theory to \textit{random} Hypotheses Spaces, i.e., Hypotheses Spaces learned from data. In this framework, one estimates a random subspace…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Algorithms · Machine Learning and Data Classification