On the regularized risk of distributionally robust learning over deep   neural networks

Camilo Garcia Trillos; Nicolas Garcia Trillos

arXiv:2109.06294·math.OC·March 29, 2022

On the regularized risk of distributionally robust learning over deep neural networks

Camilo Garcia Trillos, Nicolas Garcia Trillos

PDF

Open Access

TL;DR

This paper investigates the connection between distributionally robust learning and regularization in deep neural networks, deriving new approximations and scalable algorithms for robust training using optimal transport and control theory.

Contribution

It introduces novel regularization approximations for distributionally robust problems and develops scalable algorithms based on optimal control principles for training robust neural networks.

Findings

01

Derives first and second order regularized risk approximations.

02

Identifies mean-field optimal control structure in ResNet models.

03

Proposes scalable algorithms motivated by Pontryagin maximum principles.

Abstract

In this paper we explore the relation between distributionally robust learning and different forms of regularization to enforce robustness of deep neural networks. In particular, starting from a concrete min-max distributionally robust problem, and using tools from optimal transport theory, we derive first order and second order approximations to the distributionally robust problem in terms of appropriate regularized risk minimization problems. In the context of deep ResNet models, we identify the structure of the resulting regularization problems as mean-field optimal control problems where the number and dimension of state variables is within a dimension-free factor of the dimension of the original unrobust problem. Using the Pontryagin maximum principles associated to these problems we motivate a family of scalable algorithms for the training of robust neural networks. Our analysis…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Residual Connection · Batch Normalization · 1x1 Convolution · Residual Block · Convolution · Max Pooling · Bottleneck Residual Block · Average Pooling · Global Average Pooling