An axiomatized PDE model of deep neural networks

Tangjun Wang; Wenqi Tao; Chenglong Bao; Zuoqiang Shi

arXiv:2307.12333·cs.LG·March 25, 2024

An axiomatized PDE model of deep neural networks

Tangjun Wang, Wenqi Tao, Chenglong Bao, Zuoqiang Shi

PDF

Open Access

TL;DR

This paper establishes a PDE-based framework for deep neural networks, specifically modeling them as convection-diffusion equations, which enhances understanding, robustness, and training methods for ResNets.

Contribution

It introduces a PDE model for DNNs as convection-diffusion equations, providing new theoretical insights and a novel training approach for ResNets.

Findings

01

The convection-diffusion PDE model explains effective network behaviors.

02

The model improves neural network robustness.

03

Experimental results validate the new training method.

Abstract

Inspired by the relation between deep neural network (DNN) and partial differential equations (PDEs), we study the general form of the PDE models of deep neural networks. To achieve this goal, we formulate DNN as an evolution operator from a simple base model. Based on several reasonable assumptions, we prove that the evolution operator is actually determined by convection-diffusion equation. This convection-diffusion equation model gives mathematical explanation for several effective networks. Moreover, we show that the convection-diffusion model improves the robustness and reduces the Rademacher complexity. Based on the convection-diffusion equation, we design a new training method for ResNets. Experiments validate the performance of the proposed method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Neural Networks and Applications

MethodsBalanced Selection