Lipschitz constant estimation for general neural network architectures   using control tools

Patricia Pauli; Dennis Gramlich; Frank Allg\"ower

arXiv:2405.01125·cs.LG·November 26, 2024

Lipschitz constant estimation for general neural network architectures using control tools

Patricia Pauli, Dennis Gramlich, Frank Allg\"ower

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method for estimating the Lipschitz constant of various neural network architectures by modeling them as dynamical systems and using control theory tools, improving scalability and generality.

Contribution

It proposes a new approach that interprets neural networks as dynamical systems and employs semidefinite programming with integral quadratic constraints for Lipschitz estimation.

Findings

01

Effective estimation of Lipschitz constants for diverse architectures.

02

Demonstrated scalability on large neural networks.

03

Applied to MNIST and CIFAR-10 datasets with promising results.

Abstract

This paper is devoted to the estimation of the Lipschitz constant of general neural network architectures using semidefinite programming. For this purpose, we interpret neural networks as time-varying dynamical systems, where the $k$ -th layer corresponds to the dynamics at time $k$ . A key novelty with respect to prior work is that we use this interpretation to exploit the series interconnection structure of feedforward neural networks with a dynamic programming recursion. Nonlinearities, such as activation functions and nonlinear pooling layers, are handled with integral quadratic constraints. If the neural network contains signal processing layers (convolutional or state space model layers), we realize them as 1-D/2-D/N-D systems and exploit this structure as well. We distinguish ourselves from related work on Lipschitz constant estimation by more extensive structure exploitation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ppauli/GLipSDP
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications