Multi-Level Fine-Tuning: Closing Generalization Gaps in Approximation of Solution Maps under a Limited Budget for Training Data
Zhihan Li, Yuwei Fan, Lexing Ying

TL;DR
This paper introduces a multi-level fine-tuning approach for regression networks to improve generalization in approximating solution maps, especially when training data is limited and costly to generate.
Contribution
The paper proposes a novel multi-level fine-tuning algorithm inspired by multi-level Monte Carlo, optimizing training sample distribution across levels to reduce generalization error under data budget constraints.
Findings
Significant reduction in generalization error demonstrated in numerical experiments.
The multi-level approach effectively utilizes coarse and fine grid samples.
Theoretical analysis provides estimators for generalization error and guides data allocation.
Abstract
In scientific machine learning, regression networks have been recently applied to approximate solution maps (e.g., potential-ground state map of Schr\"odinger equation). In this paper, we aim to reduce the generalization error without spending more time in generating training samples. However, to reduce the generalization error, the regression network needs to be fit on a large number of training samples (e.g., a collection of potential-ground state pairs). The training samples can be produced by running numerical solvers, which takes much time in many applications. In this paper, we aim to reduce the generalization error without spending more time in generating training samples. Inspired by few-shot learning techniques, we develop the Multi-Level Fine-Tuning algorithm by introducing levels of training: we first train the regression network on samples generated at the coarsest grid and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Neural Networks and Applications
