Robust and Provably Monotonic Networks
Ouail Kitouni, Niklas Nolte, Mike Williams

TL;DR
This paper introduces a simple weight normalization method to constrain neural network Lipschitz constants, enabling robust, monotonic, and interpretable models with high expressiveness, applicable across various domains including particle physics and healthcare.
Contribution
The authors propose a minimally constraining normalization scheme that controls Lipschitz constants and facilitates monotonicity, enhancing robustness and interpretability of neural networks.
Findings
Successfully applied to classify subatomic particle decays at CERN.
Achieved state-of-the-art results in medicine and finance benchmarks.
Implemented as the primary data-selection algorithm in LHCb Run 3.
Abstract
The Lipschitz constant of the map between the input and output space represented by a neural network is a natural metric for assessing the robustness of the model. We present a new method to constrain the Lipschitz constant of dense deep learning models that can also be generalized to other architectures. The method relies on a simple weight normalization scheme during training that ensures the Lipschitz constant of every layer is below an upper limit specified by the analyst. A simple monotonic residual connection can then be used to make the model monotonic in any subset of its inputs, which is useful in scenarios where domain knowledge dictates such dependence. Examples can be found in algorithmic fairness requirements or, as presented here, in the classification of the decays of subatomic particles produced at the CERN Large Hadron Collider. Our normalization is minimally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParticle physics theoretical and experimental studies · Nuclear reactor physics and engineering · Adversarial Robustness in Machine Learning
MethodsWeight Normalization · Residual Connection
