An Augmented Backward-Corrected Projector Splitting Integrator for   Dynamical Low-Rank Training

Jonas Kusch; Steffen Schotth\"ofer; and Alexandra Walter

arXiv:2502.03006·math.NA·February 6, 2025

An Augmented Backward-Corrected Projector Splitting Integrator for Dynamical Low-Rank Training

Jonas Kusch, Steffen Schotth\"ofer, and Alexandra Walter

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new low-rank training method for neural networks that reduces computational costs by decreasing QR decompositions while maintaining convergence and robustness.

Contribution

We propose an augmented backward-corrected projector splitting integrator that improves efficiency and convergence in dynamical low-rank training of neural networks.

Findings

01

Reduces the number of QR decompositions needed during training.

02

Ensures convergence to a locally optimal solution.

03

Demonstrates effectiveness on multiple benchmark datasets.

Abstract

Layer factorization has emerged as a widely used technique for training memory-efficient neural networks. However, layer factorization methods face several challenges, particularly a lack of robustness during the training process. To overcome this limitation, dynamical low-rank training methods have been developed, utilizing robust time integration techniques for low-rank matrix differential equations. Although these approaches facilitate efficient training, they still depend on computationally intensive QR and singular value decompositions of matrices with small rank. In this work, we introduce a novel low-rank training method that reduces the number of required QR decompositions. Our approach integrates an augmentation step into a projector-splitting scheme, ensuring convergence to a locally optimal solution. We provide a rigorous theoretical analysis of the proposed method and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ScSteffen/Publication-Augmented-Backward-Corrected-PSI-low-rank-training
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optical Sensing Technologies