Weight-Parameterization in Continuous Time Deep Neural Networks for Surrogate Modeling

Haley Rosso; Lars Ruthotto; Khachik Sargsyan

arXiv:2507.22045·cs.LG·July 30, 2025

Weight-Parameterization in Continuous Time Deep Neural Networks for Surrogate Modeling

Haley Rosso, Lars Ruthotto, Khachik Sargsyan

PDF

TL;DR

This paper explores polynomial basis-based weight parameterizations in continuous-time neural networks, demonstrating improved training stability and efficiency while maintaining high accuracy in surrogate modeling of physical systems.

Contribution

It introduces a low-dimensional polynomial basis approach for weight parameterization in neural ODEs and ResNets, enhancing training stability and computational efficiency.

Findings

01

Legendre basis improves training stability

02

Reduces computational cost compared to unconstrained models

03

Achieves accuracy comparable or better than existing methods

Abstract

Continuous-time deep learning models, such as neural ordinary differential equations (ODEs), offer a promising framework for surrogate modeling of complex physical systems. A central challenge in training these models lies in learning expressive yet stable time-varying weights, particularly under computational constraints. This work investigates weight parameterization strategies that constrain the temporal evolution of weights to a low-dimensional subspace spanned by polynomial basis functions. We evaluate both monomial and Legendre polynomial bases within neural ODE and residual network (ResNet) architectures under discretize-then-optimize and optimize-then-discretize training paradigms. Experimental results across three high-dimensional benchmark problems show that Legendre parameterizations yield more stable training dynamics, reduce computational cost, and achieve accuracy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.