PDE+: Enhancing Generalization via PDE with Adaptive Distributional   Diffusion

Yige Yuan; Bingbing Xu; Bo Lin; Liang Hou; Fei Sun; Huawei Shen; Xueqi; Cheng

arXiv:2305.15835·cs.LG·December 18, 2023·1 cites

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

Yige Yuan, Bingbing Xu, Bo Lin, Liang Hou, Fei Sun, Huawei Shen, Xueqi, Cheng

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces PDE+ which leverages adaptive distributional diffusion within a PDE framework to enhance neural network generalization by smoothing solutions and covering unobserved data distributions.

Contribution

It establishes a PDE-based theoretical framework linking generalization to solution smoothness and proposes PDE+ to improve generalization through adaptive diffusion.

Findings

01

PDE+ outperforms state-of-the-art methods in various experiments.

02

The method improves model robustness to distribution shifts.

03

Adaptive diffusion enhances the smoothness of neural network solutions.

Abstract

The generalization of neural networks is a central challenge in machine learning, especially concerning the performance under distributions that differ from training ones. Current methods, mainly based on the data-driven paradigm such as data augmentation, adversarial training, and noise injection, may encounter limited generalization due to model non-smoothness. In this paper, we propose to investigate generalization from a Partial Differential Equation (PDE) perspective, aiming to enhance it directly through the underlying function of neural networks, rather than focusing on adjusting input data. Specifically, we first establish the connection between neural network generalization and the smoothness of the solution to a specific PDE, namely "transport equation". Building upon this, we propose a general framework that introduces adaptive distributional diffusion into transport equation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuanyige/pde-add
pytorchOfficial

Videos

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion· underline

Taxonomy

TopicsModel Reduction and Neural Networks · Machine Learning in Healthcare

MethodsDiffusion