CNN Mixture-of-Depths
Rinor Cakaj, Jens Mehnert, Bin Yang

TL;DR
The paper presents Mixture-of-Depths, a CNN method that improves computational efficiency by dynamically selecting relevant channels for processing, leading to faster inference without sacrificing accuracy.
Contribution
Introduces a static-graph, channel-selective approach called Mixture-of-Depths for CNNs that enhances efficiency without complex dynamic computation.
Findings
ResNet86-MoD outperforms ResNet50 by 0.45% on ImageNet.
ResNet75-MoD matches ResNet50 performance with 25% CPU speedup.
Method speeds up training and inference without specialized hardware or loss functions.
Abstract
We introduce Mixture-of-Depths (MoD) for Convolutional Neural Networks (CNNs), a novel approach that enhances the computational efficiency of CNNs by selectively processing channels based on their relevance to the current prediction. This method optimizes computational resources by dynamically selecting key channels in feature maps for focused processing within the convolutional blocks (Conv-Blocks), while skipping less relevant channels. Unlike conditional computation methods that require dynamic computation graphs, CNN MoD uses a static computation graph with fixed tensor sizes which improve hardware efficiency. It speeds up the training and inference processes without the need for customized CUDA kernels, unique loss functions, or finetuning. CNN MoD either matches the performance of traditional CNNs with reduced inference times, GMACs, and parameters, or exceeds their performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Digital Imaging for Blood Diseases · Medical Image Segmentation Techniques
