TL;DR
This paper introduces a PDE-inspired framework for understanding and designing deep convolutional neural networks, specifically ResNets, leading to new architectures with competitive performance and better theoretical insight.
Contribution
It establishes a PDE-based interpretation of ResNets and derives three novel architectures within parabolic and hyperbolic classes, enhancing understanding and performance.
Findings
New PDE-inspired ResNet architectures outperform existing models in experiments.
PDE theory offers valuable insights into deep learning design and analysis.
The proposed models demonstrate competitive accuracy on benchmark tasks.
Abstract
Partial differential equations (PDEs) are indispensable for modeling many physical phenomena and also commonly used for solving image processing tasks. In the latter area, PDE-based approaches interpret image data as discretizations of multivariate functions and the output of image processing algorithms as solutions to certain PDEs. Posing image processing problems in the infinite dimensional setting provides powerful tools for their analysis and solution. Over the last few decades, the reinterpretation of classical image processing problems through the PDE lens has been creating multiple celebrated approaches that benefit a vast area of tasks including image segmentation, denoising, registration, and reconstruction. In this paper, we establish a new PDE-interpretation of a class of deep convolutional neural networks (CNN) that are commonly used to learn from speech, image, and video…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
