TL;DR
This paper introduces detail-preserving pooling (DPP), an adaptive method inspired by the human visual system that enhances important structural details during downscaling in deep networks, leading to improved performance.
Contribution
The paper proposes a novel learnable pooling method, DPP, which magnifies spatial changes and preserves structural details, outperforming traditional pooling techniques.
Findings
DPP consistently outperforms max and average pooling on multiple datasets.
DPP's parameters can be learned jointly with the network.
Theoretical analysis supports DPP's ability to preserve important features.
Abstract
Most convolutional neural networks use some method for gradually downscaling the size of the hidden layers. This is commonly referred to as pooling, and is applied to reduce the number of parameters, improve invariance to certain distortions, and increase the receptive field size. Since pooling by nature is a lossy process, it is crucial that each such layer maintains the portion of the activations that is most important for the network's discriminability. Yet, simple maximization or averaging over blocks, max or average pooling, or plain downsampling in the form of strided convolutions are the standard. In this paper, we aim to leverage recent results on image downscaling for the purposes of deep learning. Inspired by the human visual system, which focuses on local spatial changes, we propose detail-preserving pooling (DPP), an adaptive pooling method that magnifies spatial changes and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
