Precise asymptotic analysis of Sobolev training for random feature models
Katharine E Fisher, Matthew TC Li, Youssef Marzouk, Timo Schorlepp

TL;DR
This paper provides a detailed theoretical analysis of Sobolev training for random feature models, revealing how gradient information influences generalization in high-dimensional, overparameterized settings.
Contribution
It offers the first precise asymptotic characterization of Sobolev training effects on random feature models using advanced statistical physics methods.
Findings
Supplementing data with gradients does not always improve performance.
Overparameterization level influences the effectiveness of Sobolev training.
Optimal performance occurs when models interpolate noisy function and gradient data.
Abstract
Gradient information is widely useful and available in applications, and is therefore natural to include in the training of neural networks. Yet little is known theoretically about the impact of Sobolev training -- regression with both function and gradient data -- on the generalization error of highly overparameterized predictive models in high dimensions. In this paper, we obtain a precise characterization of this training modality for random feature (RF) models in the limit where the number of trainable parameters, input dimensions, and training data tend proportionally to infinity. Our model for Sobolev training reflects practical implementations by sketching gradient data onto finite dimensional subspaces. By combining the replica method from statistical physics with linearizations in operator-valued free probability theory, we derive a closed-form description for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Markov Chains and Monte Carlo Methods
