Nuclear Norm Regularization for Deep Learning
Christopher Scarvelis, Justin Solomon

TL;DR
This paper introduces an efficient method for applying nuclear norm regularization to deep learning models, encouraging low-rank local behavior of functions, and demonstrates its effectiveness in denoising and representation learning tasks.
Contribution
It proposes a scalable approach to penalize the Jacobian nuclear norm in deep networks, including a novel approximation that avoids direct Jacobian computations.
Findings
Method scales to high-dimensional problems
Improves denoising performance
Enhances representation learning
Abstract
Penalizing the nuclear norm of a function's Jacobian encourages it to locally behave like a low-rank linear map. Such functions vary locally along only a handful of directions, making the Jacobian nuclear norm a natural regularizer for machine learning problems. However, this regularizer is intractable for high-dimensional problems, as it requires computing a large Jacobian matrix and taking its singular value decomposition. We show how to efficiently penalize the Jacobian nuclear norm using techniques tailor-made for deep learning. We prove that for functions parametrized as compositions , one may equivalently penalize the average squared Frobenius norm of and . We then propose a denoising-style approximation that avoids the Jacobian computations altogether. Our method is simple, efficient, and accurate, enabling Jacobian nuclear norm regularization to scale to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications
