Deep learning and the rate of approximation by flows
Jingpu Cheng, Qianxiao Li, Ting Lin, Zuowei Shen

TL;DR
This paper explores how the depth of deep residual networks influences their approximation capabilities, framing the problem in a continuous dynamical systems context and linking it to geometric properties of diffeomorphisms.
Contribution
It introduces a geometric framework using sub-Finsler manifolds to quantify the minimal time for approximation, connecting learning efficiency to architectural compatibility.
Findings
Approximation capacity relates to geodesic distances on diffeomorphism manifolds.
Deep learning approximation differs fundamentally from linear approximation theory.
The minimal time for approximation is characterized by a variational principle.
Abstract
We investigate the dependence of the approximation capacity of deep residual networks on its depth in a continuous dynamical systems setting. This can be formulated as the general problem of quantifying the minimal time-horizon required to approximate a diffeomorphism by flows driven by a given family of vector fields. We show that this minimal time can be identified as a geodesic distance on a sub-Finsler manifold of diffeomorphisms, where the local geometry is characterised by a variational principle involving . This connects the learning efficiency of target relationships to their compatibility with the learning architectural choice. Further, the results suggest that the key approximation mechanism in deep learning, namely the approximation of functions by composition or dynamics, differs in a fundamental way from linear approximation theory, where linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Statistical Mechanics and Entropy
