Wasserstein Distributionally Robust Shallow Convex Neural Networks

Julien Pallage; Antoine Lesage-Landry

arXiv:2407.16800·cs.LG·August 22, 2025

Wasserstein Distributionally Robust Shallow Convex Neural Networks

Julien Pallage, Antoine Lesage-Landry

PDF

1 Repo

TL;DR

This paper introduces WaDiRo-SCNNs, a convex neural network training method based on Wasserstein distributional robustness, providing reliable predictions under data corruption with guarantees and applicability to critical sectors.

Contribution

The paper develops a novel convex training framework for shallow ReLU neural networks using Wasserstein distributionally robust optimization, enabling scalable, safe, and verifiable models.

Findings

01

Demonstrates improved out-of-sample performance on synthetic and real-world data.

02

Enforces physical constraints within the training process.

03

Provides stability verification through a mixed-integer convex program.

Abstract

In this work, we propose Wasserstein distributionally robust shallow convex neural networks (WaDiRo-SCNNs) to provide reliable nonlinear predictions when subject to adverse and corrupted datasets. Our approach is based on the reformulation of a new convex training program for ReLU-based shallow neural networks, which allows us to cast the problem into the order-1 Wasserstein distributionally robust optimization framework. Our training procedure is conservative, has low stochasticity, is solvable with open-source solvers, and is scalable to large industrial deployments. We provide out-of-sample performance guarantees, show that hard convex physical constraints can be enforced in the training program, and propose a mixed-integer convex post-training verification program to evaluate model stability. WaDiRo-SCNN aims to make neural networks safer for critical applications, such as in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jupall/WaDiRo-SCNN
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.