Learning Deep Hybrid Models with Sharpness-Aware Minimization

Naoya Takeishi

arXiv:2602.06837·cs.LG·February 9, 2026

Learning Deep Hybrid Models with Sharpness-Aware Minimization

Naoya Takeishi

PDF

Open Access

TL;DR

This paper introduces a novel approach to hybrid modeling by applying sharpness-aware minimization to improve model simplicity and robustness, demonstrated through experiments across various models and datasets.

Contribution

It adapts sharpness-aware minimization to hybrid models, enhancing their stability and interpretability regardless of model architecture or domain knowledge.

Findings

01

SAM-based method improves hybrid model robustness

02

Effective across different models and datasets

03

Promotes simpler, more stable solutions

Abstract

Hybrid modeling, the combination of machine learning models and scientific mathematical models, enables flexible and robust data-driven prediction with partial interpretability. However, effectively the scientific models may be ignored in prediction due to the flexibility of the machine learning model, making the idea of hybrid modeling pointless. Typically some regularization is applied to hybrid model learning to avoid such a failure case, but the formulation of the regularizer strongly depends on model architectures and domain knowledge. In this paper, we propose to focus on the flatness of loss minima in learning hybrid models, aiming to make the model as simple as possible. We employ the idea of sharpness-aware minimization and adapt it to the hybrid modeling setting. Numerical experiments show that the SAM-based method works well across different choices of models and datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Explainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis