Symplectic Methods in Deep Learning

Sofya Maslovskaya; Sina Ober-Bl\"obaum

arXiv:2406.04104·math.NA·June 7, 2024

Symplectic Methods in Deep Learning

Sofya Maslovskaya, Sina Ober-Bl\"obaum

PDF

Open Access

TL;DR

This paper introduces symplectic neural networks based on higher order explicit methods, combining theoretical stability guarantees with practical efficiency in modeling dynamical systems.

Contribution

It develops symplectic networks using higher order explicit methods that maintain non-vanishing gradients, enhancing stability and efficiency in learning dynamical systems.

Findings

01

Symplectic networks with higher order methods exhibit non-vanishing gradients.

02

The proposed architectures demonstrate improved efficiency on dynamical system tasks.

03

The approach combines theoretical guarantees with practical performance.

Abstract

Deep learning is widely used in tasks including image recognition and generation, in learning dynamical systems from data and many more. It is important to construct learning architectures with theoretical guarantees to permit safety in the applications. There has been considerable progress in this direction lately. In particular, symplectic networks were shown to have the non vanishing gradient property, essential for numerical stability. On the other hand, architectures based on higher order numerical methods were shown to be efficient in many tasks where the learned function has an underlying dynamical structure. In this work we construct symplectic networks based on higher order explicit methods with non vanishing gradient property and test their efficiency on various examples.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications