Improving the Efficiency of Gradient Descent Algorithms Applied to Optimization Problems with Dynamical Constraints
Ion Matei, Maksym Zhenirovskyy, Johan de Kleer, John Maxwell

TL;DR
This paper presents two novel block coordinate descent algorithms for optimization problems with ODE constraints that avoid sensitivity analysis, achieving significant speedups and improved accuracy over traditional methods, especially on large-scale problems.
Contribution
The paper introduces two new algorithms that bypass the need for sensitivity analysis in ODE-constrained optimization, enhancing efficiency and scalability.
Findings
At least 4x faster than Pytorch implementations.
At least 16x faster than Jax implementations.
More accurate results on training and test data.
Abstract
We introduce two block coordinate descent algorithms for solving optimization problems with ordinary differential equations (ODEs) as dynamical constraints. The algorithms do not need to implement direct or adjoint sensitivity analysis methods to evaluate loss function gradients. They results from reformulation of the original problem as an equivalent optimization problem with equality constraints. The algorithms naturally follow from steps aimed at recovering the gradient-decent algorithm based on ODE solvers that explicitly account for sensitivity of the ODE solution. In our first proposed algorithm we avoid explicitly solving the ODE by integrating the ODE solver as a sequence of implicit constraints. In our second algorithm, we use an ODE solver to reset the ODE solution, but no direct are adjoint sensitivity analysis methods are used. Both algorithm accepts mini-batch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Machine Learning and ELM
MethodsTest
