Multi-Level Fine-Tuning: Closing Generalization Gaps in Approximation of   Solution Maps under a Limited Budget for Training Data

Zhihan Li; Yuwei Fan; Lexing Ying

arXiv:2102.07169·math.NA·February 16, 2021

Multi-Level Fine-Tuning: Closing Generalization Gaps in Approximation of Solution Maps under a Limited Budget for Training Data

Zhihan Li, Yuwei Fan, Lexing Ying

PDF

Open Access

TL;DR

This paper introduces a multi-level fine-tuning approach for regression networks to improve generalization in approximating solution maps, especially when training data is limited and costly to generate.

Contribution

The paper proposes a novel multi-level fine-tuning algorithm inspired by multi-level Monte Carlo, optimizing training sample distribution across levels to reduce generalization error under data budget constraints.

Findings

01

Significant reduction in generalization error demonstrated in numerical experiments.

02

The multi-level approach effectively utilizes coarse and fine grid samples.

03

Theoretical analysis provides estimators for generalization error and guides data allocation.

Abstract

In scientific machine learning, regression networks have been recently applied to approximate solution maps (e.g., potential-ground state map of Schr\"odinger equation). In this paper, we aim to reduce the generalization error without spending more time in generating training samples. However, to reduce the generalization error, the regression network needs to be fit on a large number of training samples (e.g., a collection of potential-ground state pairs). The training samples can be produced by running numerical solvers, which takes much time in many applications. In this paper, we aim to reduce the generalization error without spending more time in generating training samples. Inspired by few-shot learning techniques, we develop the Multi-Level Fine-Tuning algorithm by introducing levels of training: we first train the regression network on samples generated at the coarsest grid and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Neural Networks and Applications