Hyperplane bounds for neural feature mappings

Antonio Jimeno Yepes

arXiv:2201.05799·cs.LG·January 19, 2022

Hyperplane bounds for neural feature mappings

Antonio Jimeno Yepes

PDF

Open Access 1 Repo

TL;DR

This paper investigates how neural network feature mappings can be optimized to reduce the VC-dimension of the resulting hyperplane, thereby improving generalization especially with small training datasets.

Contribution

It introduces a method to define a loss that controls the VC-dimension of the separating hyperplane in neural feature mappings.

Findings

01

Performance improves with small training sets using the proposed method.

02

The approach effectively reduces the VC-dimension of the hyperplane.

03

The method offers a new way to enhance neural network generalization.

Abstract

Deep learning methods minimise the empirical risk using loss functions such as the cross entropy loss. When minimising the empirical risk, the generalisation of the learnt function still depends on the performance on the training data, the Vapnik-Chervonenkis(VC)-dimension of the function and the number of training examples. Neural networks have a large number of parameters, which correlates with their VC-dimension that is typically large but not infinite, and typically a large number of training instances are needed to effectively train them. In this work, we explore how to optimize feature mappings using neural network with the intention to reduce the effective VC-dimension of the hyperplane found in the space generated by the mapping. An interpretation of the results of this study is that it is possible to define a loss that controls the VC-dimension of the separating hyperplane.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ajjimeno/nn-hyperplane-bounds
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Neural Networks and Applications · Explainable Artificial Intelligence (XAI)