Expansive Natural Neural Gradient Flows for Energy Minimization
Wolfgang Dahmen, Wuchen Li, Yuankai Teng, Zhu Wang

TL;DR
This paper introduces expansive gradient dynamics in neural network spaces for energy minimization, proposing an adaptive expansion strategy to improve alignment with Hilbert space gradients, with applications in supervised learning and inverse problems.
Contribution
It develops a novel neural network expansion method that enhances gradient flow alignment in energy minimization tasks within a Hilbert space framework.
Findings
Effective for simple model problems in supervised learning and inverse problems
Highlights importance of neural flow matrix assembly based on Hilbert space inner product
Demonstrates the potential of adaptive neural network expansion in gradient-based energy minimization
Abstract
This paper develops expansive gradient dynamics in deep neural network-induced mapping spaces. Specifically, we generate tools and concepts for minimizing a class of energy functionals in an abstract Hilbert space setting covering a wide scope of applications such as PDEs-based inverse problems and supervised learning. The approach hinges on a Hilbert space metric in the full diffeomorphism mapping space, which could be viewed as a generalized Wasserstein-2 metric. We then study a projection gradient descent method within deep neural network parameterized sets. More importantly, we develop an adaptation and expanding strategy to step-by-step enlarge the deep neural network structures. In particular, the expansion mechanism aims to enhance the alignment of the neural manifold induced natural gradient direction as well as possible with the ideal Hilbert space gradient descent direction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis
