Group and Shuffle: Efficient Structured Orthogonal Parametrization

Mikhail Gorbunov; Nikolay Yudin; Vera Soboleva; Aibek Alanov; Alexey; Naumov; Maxim Rakhuba

arXiv:2406.10019·cs.LG·June 17, 2024

Group and Shuffle: Efficient Structured Orthogonal Parametrization

Mikhail Gorbunov, Nikolay Yudin, Vera Soboleva, Aibek Alanov, Alexey, Naumov, Maxim Rakhuba

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new structured orthogonal parametrization that enhances the efficiency of orthogonal fine-tuning in neural networks, applicable across various domains including text-to-image models and language tasks.

Contribution

It unifies and generalizes structured matrix classes to improve parameter and computational efficiency in orthogonal fine-tuning methods.

Findings

01

Improved efficiency in fine-tuning large models.

02

Successful application to text-to-image diffusion models.

03

Effective adaptation for orthogonal convolutions and 1-Lipschitz networks.

Abstract

The increasing size of neural networks has led to a growing demand for methods of efficient fine-tuning. Recently, an orthogonal fine-tuning paradigm was introduced that uses orthogonal matrices for adapting the weights of a pretrained model. In this paper, we introduce a new class of structured matrices, which unifies and generalizes structured classes from previous works. We examine properties of this class and build a structured orthogonal parametrization upon it. We then use this parametrization to modify the orthogonal fine-tuning framework, improving parameter and computational efficiency. We empirically validate our method on different domains, including adapting of text-to-image diffusion models and downstream task fine-tuning in language modeling. Additionally, we adapt our construction for orthogonal convolutions and conduct experiments with 1-Lipschitz neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

skonor/group_and_shuffle
pytorch

Videos

Group and Shuffle: Efficient Structured Orthogonal Parametrization· slideslive

Taxonomy

TopicsEmbedded Systems Design Techniques · Medical Image Segmentation Techniques · Computational Geometry and Mesh Generation

MethodsDiffusion