Nuclear Norm Regularization for Deep Learning

Christopher Scarvelis; Justin Solomon

arXiv:2405.14544·cs.LG·October 11, 2024

Nuclear Norm Regularization for Deep Learning

Christopher Scarvelis, Justin Solomon

PDF

Open Access 1 Video

TL;DR

This paper introduces an efficient method for applying nuclear norm regularization to deep learning models, encouraging low-rank local behavior of functions, and demonstrates its effectiveness in denoising and representation learning tasks.

Contribution

It proposes a scalable approach to penalize the Jacobian nuclear norm in deep networks, including a novel approximation that avoids direct Jacobian computations.

Findings

01

Method scales to high-dimensional problems

02

Improves denoising performance

03

Enhances representation learning

Abstract

Penalizing the nuclear norm of a function's Jacobian encourages it to locally behave like a low-rank linear map. Such functions vary locally along only a handful of directions, making the Jacobian nuclear norm a natural regularizer for machine learning problems. However, this regularizer is intractable for high-dimensional problems, as it requires computing a large Jacobian matrix and taking its singular value decomposition. We show how to efficiently penalize the Jacobian nuclear norm using techniques tailor-made for deep learning. We prove that for functions parametrized as compositions $f = g \circ h$ , one may equivalently penalize the average squared Frobenius norm of $J g$ and $J h$ . We then propose a denoising-style approximation that avoids the Jacobian computations altogether. Our method is simple, efficient, and accurate, enabling Jacobian nuclear norm regularization to scale to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Nuclear Norm Regularization for Deep Learning· slideslive

Taxonomy

TopicsNeural Networks and Applications