SeKron: A Decomposition Method Supporting Many Factorization Structures
Marawan Gamal Abdel Hameed, Ali Mosleh, Marzieh S. Tahaei, Vahid, Partovi Nia

TL;DR
SeKron introduces a versatile tensor decomposition technique using Kronecker products, enabling flexible CNN compression that outperforms existing methods across various vision tasks.
Contribution
SeKron provides a unified framework for multiple factorization structures, generalizing existing decompositions and improving CNN model compression efficiency.
Findings
Outperforms state-of-the-art tensor decompositions in CNN compression
Supports a wide variety of factorization structures
Effective on both high-level and low-level vision tasks
Abstract
While convolutional neural networks (CNNs) have become the de facto standard for most image processing and computer vision applications, their deployment on edge devices remains challenging. Tensor decomposition methods provide a means of compressing CNNs to meet the wide range of device constraints by imposing certain factorization structures on their convolution tensors. However, being limited to the small set of factorization structures presented by state-of-the-art decomposition approaches can lead to sub-optimal performance. We propose SeKron, a novel tensor decomposition method that offers a wide variety of factorization structures, using sequences of Kronecker products. By recursively finding approximating Kronecker factors, we arrive at optimal decompositions for each of the factorization structures. We show that SeKron is a flexible decomposition that generalizes widely used…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Parallel Computing and Optimization Techniques · Advanced Neural Network Applications
MethodsTuckER · Convolution
