Convergence and Sketching-Based Efficient Computation of Neural Tangent Kernel Weights in Physics-Based Loss
Max Hirsch, Federico Pichi

TL;DR
This paper proves convergence of adaptive NTK-based weights in neural network training, and introduces a randomized sketching algorithm to efficiently compute these weights, supported by numerical experiments.
Contribution
It provides the first convergence analysis for NTK-based adaptive weights and develops a novel randomized algorithm for efficient NTK computation in neural networks.
Findings
Proved convergence of gradient descent with adaptive NTK weights.
Developed a randomized predictor-corrector algorithm for NTK estimation.
Numerical experiments demonstrate the efficiency and accuracy of the proposed method.
Abstract
In multi-objective optimization, multiple loss terms are weighted and added together to form a single objective. These weights are chosen to properly balance the competing losses according to some meta-goal. For example, in physics-informed neural networks (PINNs), these weights are often adaptively chosen to improve the network's generalization error. A popular choice of adaptive weights is based on the neural tangent kernel (NTK) of the PINN, which describes the evolution of the network in predictor space during training. The convergence of such an adaptive weighting algorithm is not clear a priori. Moreover, these NTK-based weights would be updated frequently during training, further increasing the computational burden of the learning process. In this paper, we prove that under appropriate conditions, gradient descent enhanced with adaptive NTK-based weights is convergent in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Machine Learning and ELM
