Introduction to optimization methods for training SciML models
Alena Kopani\v{c}\'akov\'a, Elisa Riccietti

TL;DR
This paper provides a unified overview of optimization methods tailored for training scientific machine learning models, highlighting differences from classical ML and discussing suitable algorithms for physics-informed problems.
Contribution
It offers a comprehensive review of optimization techniques adapted for SciML, emphasizing problem structure and practical strategies for physics-constrained models.
Findings
Standard stochastic methods often limited in SciML due to problem structure
Deterministic and curvature-aware methods show promise for SciML optimization
Tutorial examples illustrate practical application of discussed optimization strategies
Abstract
Optimization is central to both modern machine learning (ML) and scientific machine learning (SciML), yet the structure of the underlying optimization problems differs substantially across these domains. Classical ML typically relies on stochastic, sample-separable objectives that favor first-order and adaptive gradient methods. In contrast, SciML often involves physics-informed or operator-constrained formulations in which differential operators induce global coupling, stiffness, and strong anisotropy in the loss landscape. As a result, optimization behavior in SciML is governed by the spectral properties of the underlying physical models rather than by data statistics, frequently limiting the effectiveness of standard stochastic methods and motivating deterministic or curvature-aware approaches. This document provides a unified introduction to optimization methods in ML and SciML,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Machine Learning in Materials Science · Stochastic Gradient Optimization Techniques
