Self-grouping Convolutional Neural Networks
Qingbei Guo, Xiao-Jun Wu, Josef Kittler, Zhiquan Feng

TL;DR
This paper introduces SG-CNN, a novel self-grouping convolutional neural network method that dynamically groups filters based on importance similarity, leading to improved efficiency and accuracy across various architectures and tasks.
Contribution
The paper proposes a data-dependent, importance-based filter grouping method for CNNs, enabling adaptive, diverse group convolution filters that enhance compression and performance.
Findings
Outperforms existing methods on CIFAR-10/100 and ImageNet datasets.
Compatible with architectures like ResNet and DenseNet, improving efficiency.
Demonstrates effective transfer learning, domain adaptation, and object detection.
Abstract
Although group convolution operators are increasingly used in deep convolutional neural networks to improve the computational efficiency and to reduce the number of parameters, most existing methods construct their group convolution architectures by a predefined partitioning of the filters of each convolutional layer into multiple regular filter groups with an equal spatial group size and data-independence, which prevents a full exploitation of their potential. To tackle this issue, we propose a novel method of designing self-grouping convolutional neural networks, called SG-CNN, in which the filters of each convolutional layer group themselves based on the similarity of their importance vectors. Concretely, for each filter, we first evaluate the importance value of their input channels to identify the importance vectors, and then group these vectors by clustering. Using the resulting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
MethodsConcatenated Skip Connection · Dense Block · Batch Normalization · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Softmax · Residual Block · Convolution · Bottleneck Residual Block · 1x1 Convolution
