Parameter Efficient Deep Neural Networks with Bilinear Projections

Litao Yu; Yongsheng Gao; Jun Zhou; Jian Zhang

arXiv:2011.01391·cs.CV·November 4, 2020

Parameter Efficient Deep Neural Networks with Bilinear Projections

Litao Yu, Yongsheng Gao, Jun Zhou, Jian Zhang

PDF

1 Repo

TL;DR

This paper introduces bilinear projections in deep neural networks to significantly reduce parameter count and model size, while maintaining or improving accuracy, making models more suitable for memory-limited devices.

Contribution

It proposes a novel bilinear projection method that reduces model complexity from quadratic to linear, addressing parameter redundancy and enabling efficient deep models.

Findings

01

Achieves higher accuracy than full DNNs on benchmarks

02

Reduces model size significantly

03

Maintains or improves accuracy with fewer parameters

Abstract

Recent research on deep neural networks (DNNs) has primarily focused on improving the model accuracy. Given a proper deep learning framework, it is generally possible to increase the depth or layer width to achieve a higher level of accuracy. However, the huge number of model parameters imposes more computational and memory usage overhead and leads to the parameter redundancy. In this paper, we address the parameter redundancy problem in DNNs by replacing conventional full projections with bilinear projections. For a fully-connected layer with $D$ input nodes and $D$ output nodes, applying bilinear projection can reduce the model space complexity from $O (D^{2})$ to $O (2 D)$ , achieving a deep model with a sub-linear layer size. However, structured projection has a lower freedom of degree compared to the full projection, causing the under-fitting problem. So we simply…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yutao1008/Bi-DNNs
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.