Resolving gradient pathology in physics-informed epidemiological models
Nickson Golooba, Woldegebriel Assefa Woldegerima

TL;DR
This paper introduces conflict-gated gradient scaling (CGGS), a novel method to stabilize and accelerate physics-informed neural network training in epidemiology by dynamically managing gradient conflicts, leading to better convergence and parameter estimation.
Contribution
We propose CGGS, a new gradient conflict resolution technique that preserves convergence guarantees and improves training stability and efficiency in physics-informed epidemiological models.
Findings
CGGS improves convergence speed over baseline methods.
It enhances parameter estimation accuracy in stiff epidemiological systems.
The method induces a curriculum learning effect, aiding training.
Abstract
Physics-informed neural networks (PINNs) are increasingly used in mathematical epidemiology to bridge the gap between noisy clinical data and compartmental models, such as the susceptible-exposed-infected-removed (SEIR) model. However, training these hybrid networks is often unstable due to competing optimization objectives. As established in recent literature on ``gradient pathology," the gradient vectors derived from the data loss and the physical residual often point in conflicting directions, leading to slow convergence or optimization deadlock. While existing methods attempt to resolve this by balancing gradient magnitudes or projecting conflicting vectors, we propose a novel method, conflict-gated gradient scaling (CGGS), to address gradient conflicts in physics-informed neural networks for epidemiological modelling, ensuring stable and efficient training and a computationally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Reservoir Computing · Generative Adversarial Networks and Image Synthesis
