Manifold Regularized Discriminative Neural Networks
Shuangfei Zhai, Zhongfei Zhang

TL;DR
This paper introduces manifold regularization techniques for discriminative neural networks, improving generalization by modeling input data distribution and leveraging the manifold hypothesis, applicable to both supervised and semi-supervised learning.
Contribution
It proposes two novel regularizers that incorporate the data manifold into DNN training, enhancing generalization especially in semi-supervised settings.
Findings
Significant performance improvements on MNIST, CIFAR10, and SVHN datasets.
Effective semi-supervised learning with label-independent regularization.
Enhanced generalization by modeling data distribution within discriminative frameworks.
Abstract
Unregularized deep neural networks (DNNs) can be easily overfit with a limited sample size. We argue that this is mostly due to the disriminative nature of DNNs which directly model the conditional probability (or score) of labels given the input. The ignorance of input distribution makes DNNs difficult to generalize to unseen data. Recent advances in regularization techniques, such as pretraining and dropout, indicate that modeling input data distribution (either explicitly or implicitly) greatly improves the generalization ability of a DNN. In this work, we explore the manifold hypothesis which assumes that instances within the same class lie in a smooth manifold. We accordingly propose two simple regularizers to a standard discriminative DNN. The first one, named Label-Aware Manifold Regularization, assumes the availability of labels and penalizes large norms of the loss function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
