(Almost) Smooth Sailing: Towards Numerical Stability of Neural Networks   Through Differentiable Regularization of the Condition Number

Rossen Nenov; Daniel Haider; Peter Balazs

arXiv:2410.00169·cs.LG·October 2, 2024

(Almost) Smooth Sailing: Towards Numerical Stability of Neural Networks Through Differentiable Regularization of the Condition Number

Rossen Nenov, Daniel Haider, Peter Balazs

PDF

Open Access 1 Repo

TL;DR

This paper proposes a differentiable regularizer based on the condition number to improve numerical stability in neural networks, enabling gradient-based optimization and demonstrating benefits in noisy classification and image denoising tasks.

Contribution

It introduces a novel, almost everywhere differentiable regularizer for the condition number, facilitating stable training of neural networks.

Findings

01

Improved stability in noisy classification tasks.

02

Enhanced denoising performance on MNIST images.

03

Regularizer is easy to implement and integrate.

Abstract

Maintaining numerical stability in machine learning models is crucial for their reliability and performance. One approach to maintain stability of a network layer is to integrate the condition number of the weight matrix as a regularizing term into the optimization algorithm. However, due to its discontinuous nature and lack of differentiability the condition number is not suitable for a gradient descent approach. This paper introduces a novel regularizer that is provably differentiable almost everywhere and promotes matrices with low condition numbers. In particular, we derive a formula for the gradient of this regularizer which can be easily implemented and integrated into existing optimization algorithms. We show the advantages of this approach for noisy classification and denoising of MNIST images.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

danedane-haider/Almost-Smooth-Sailing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Neural Networks and Applications