Squeeze-and-Remember Block
Rinor Cakaj, Jens Mehnert, Bin Yang

TL;DR
The paper introduces the Squeeze-and-Remember (SR) block, a novel CNN component that adds dynamic memory-like features, improving accuracy on image datasets with minimal extra computation.
Contribution
The SR block is a new architectural unit that enables CNNs to dynamically memorize and reapply important features, enhancing their contextual understanding.
Findings
Improved ImageNet top-1 accuracy by 0.52% with SR block in ResNet50.
Increased Cityscapes mean IOU by 0.20% using SR in DeepLab v3.
Achieved these improvements with minimal computational overhead.
Abstract
Convolutional Neural Networks (CNNs) are important for many machine learning tasks. They are built with different types of layers: convolutional layers that detect features, dropout layers that help to avoid over-reliance on any single neuron, and residual layers that allow the reuse of features. However, CNNs lack a dynamic feature retention mechanism similar to the human brain's memory, limiting their ability to use learned information in new contexts. To bridge this gap, we introduce the "Squeeze-and-Remember" (SR) block, a novel architectural unit that gives CNNs dynamic memory-like functionalities. The SR block selectively memorizes important features during training, and then adaptively re-applies these features during inference. This improves the network's ability to make contextually informed predictions. Empirical results on ImageNet and Cityscapes datasets demonstrate the SR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
MethodsDense Connections · Feedforward Network · Dilated Convolution · Conditional Random Field · Dropout · DeepLab
