Deep Neural Networks Motivated by Partial Differential Equations

Lars Ruthotto; Eldad Haber

arXiv:1804.04272·cs.LG·December 12, 2018

Deep Neural Networks Motivated by Partial Differential Equations

Lars Ruthotto, Eldad Haber

PDF

1 Repo

TL;DR

This paper introduces a PDE-inspired framework for understanding and designing deep convolutional neural networks, specifically ResNets, leading to new architectures with competitive performance and better theoretical insight.

Contribution

It establishes a PDE-based interpretation of ResNets and derives three novel architectures within parabolic and hyperbolic classes, enhancing understanding and performance.

Findings

01

New PDE-inspired ResNet architectures outperform existing models in experiments.

02

PDE theory offers valuable insights into deep learning design and analysis.

03

The proposed models demonstrate competitive accuracy on benchmark tasks.

Abstract

Partial differential equations (PDEs) are indispensable for modeling many physical phenomena and also commonly used for solving image processing tasks. In the latter area, PDE-based approaches interpret image data as discretizations of multivariate functions and the output of image processing algorithms as solutions to certain PDEs. Posing image processing problems in the infinite dimensional setting provides powerful tools for their analysis and solution. Over the last few decades, the reinterpretation of classical image processing problems through the PDE lens has been creating multiple celebrated approaches that benefit a vast area of tasks including image segmentation, denoising, registration, and reconstruction. In this paper, we establish a new PDE-interpretation of a class of deep convolutional neural networks (CNN) that are commonly used to learn from speech, image, and video…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EmoryMLIP/DynamicBlocks
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution