Image Classification via Random Dilated Convolution with Multi-Branch Feature Extraction and Context Excitation
Wentao Jiang, Yuanchan Xu, Heng Yuan

TL;DR
This paper introduces RDCNet, a novel CNN architecture that enhances image classification by integrating multi-scale feature extraction, noise robustness, and dynamic feature emphasis, achieving state-of-the-art results across multiple datasets.
Contribution
The paper presents RDCNet, combining multi-branch dilated convolutions, fine-grained feature enhancement, and context excitation modules to improve classification accuracy and robustness.
Findings
RDCNet outperforms existing methods on CIFAR-10 and CIFAR-100.
Achieves significant accuracy improvements on SVHN, Imagenette, and Imagewoof.
Demonstrates robustness against noise and irrelevant background interference.
Abstract
Image classification remains a fundamental yet challenging task in computer vision, particularly when fine-grained feature extraction and background noise suppression are required simultaneously. Conventional convolutional neural networks, despite their remarkable success in hierarchical feature learning, often struggle with capturing multi-scale contextual information and are susceptible to overfitting when confronted with noisy or irrelevant image regions. In this paper, we propose RDCNet (Image Classification Network with Random Dilated Convolution), a novel architecture built upon ResNet-34 that integrates three synergistic innovations to address these limitations: (1) a Multi-Branch Random Dilated Convolution (MRDC) module that employs parallel branches with varying dilation rates combined with a stochastic masking mechanism to capture fine-grained features across multiple scales…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
