Compact Model Training by Low-Rank Projection with Energy Transfer
Kailing Guo, Zhenquan Lin, Canyang Chen, Xiaofen Xing, Fang Liu,, Xiangmin Xu

TL;DR
This paper introduces LRPET, a novel training method for low-rank neural networks that combines stochastic gradient descent with energy transfer to improve training from scratch and maintain model capacity.
Contribution
The paper proposes a new low-rank training approach with energy transfer and BN rectification, enabling effective training from scratch and better low-rank approximation.
Findings
LRPET achieves competitive performance compared to re-training methods.
Energy transfer alleviates gradient vanishing caused by projection.
BN rectification improves low-rank approximation accuracy.
Abstract
Low-rankness plays an important role in traditional machine learning, but is not so popular in deep learning. Most previous low-rank network compression methods compress networks by approximating pre-trained models and re-training. However, the optimal solution in the Euclidean space may be quite different from the one with low-rank constraint. A well-pre-trained model is not a good initialization for the model with low-rank constraints. Thus, the performance of a low-rank compressed network degrades significantly. Compared with other network compression methods such as pruning, low-rank methods attract less attention in recent years. In this paper, we devise a new training method, low-rank projection with energy transfer (LRPET), that trains low-rank compressed networks from scratch and achieves competitive performance. We propose to alternately perform stochastic gradient descent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Medical Image Segmentation Techniques · Advanced Image Processing Techniques
MethodsPruning · Batch Normalization
