An output scaling layer boosts deep neural networks for multiscale ODE systems
Yuxiao Yi, Weizong Wang, Tianhan Zhang, Zhi-Qin John Xu

TL;DR
This paper introduces the Generalized Box-Cox Transformation (GBCT), a nonlinear data scaling method that significantly improves deep neural network performance on multiscale ODE systems by reducing prediction errors and aligning data frequencies.
Contribution
The paper proposes GBCT, a novel nonlinear scaling technique that enhances neural network modeling of multiscale ODE systems, addressing frequency bias and improving accuracy.
Findings
GBCT reduces prediction errors by up to two orders of magnitude.
GBCT achieves similar performance with only one-sixth of training epochs.
GBCT rescales high-frequency components to lower frequencies, improving training and generalization.
Abstract
Simulating complex diffusion-reaction systems is often prohibitively expensive due to the high dimensionality and stiffness of the underlying ODEs, where state variables may span tens of orders of magnitude. Deep learning has recently emerged as a powerful tool in scientific computing, achieving remarkable progress in modeling and sampling stiff systems. However, data scaling techniques remain largely underexplored, despite their crucial role in addressing the frequency bias of deep neural networks when handling multi-magnitude or high-frequency data. In this work, we propose the Generalized Box-Cox Transformation (GBCT), a novel nonlinear scaling method designed to mitigate multiscale challenges by rescaling inherent multi-magnitude components toward a more consistent order of magnitude. We integrate GBCT into our previous data-driven framework and evaluate its performance against the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Machine Learning in Materials Science · Quantum chaos and dynamical systems
