Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training
Ipsita Ghosh, Ethan Nguyen, Christian K\"ummerle

TL;DR
The paper introduces Q3R, a novel low-rank regularizer inspired by IRLS, enabling effective low-rank training of deep models with minimal performance loss and computational overhead.
Contribution
Q3R is a new quadratic regularizer that induces low-rank structure in model weights, allowing prescribed low-rank training while maintaining competitive accuracy.
Findings
Q3R enables training to specific low ranks with minor accuracy drops.
Q3R achieves comparable performance to dense models with reduced parameters.
Effective on vision and language transformer models.
Abstract
Parameter-efficient training based on low-rank optimization has become a highly successful tool for fine-tuning large deep learning models. However, these methods often fail for low-rank pre-training, where simultaneously maintaining low-rank weight structure and optimizing the task objective remains challenging. We propose the (), which leads to a novel low-rank-inducing training strategy inspired by the Iteratively Reweighted Least Squares (IRLS) framework. is based on a quadratic regularizer term that majorizes a smoothed log-determinant rank surrogate. Unlike other low-rank training techniques, can train weight matrices to prescribed low target ranks while achieving predictive performance comparable to dense models, with small computational overhead and full compatibility with existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning
