Power Normalizations in Fine-grained Image, Few-shot Image and Graph Classification
Piotr Koniusz, Hongguang Zhang

TL;DR
This paper introduces a novel power normalization layer for deep learning that leverages second-order pooling and spectral methods, improving feature representation in image and graph classification tasks.
Contribution
It proposes a new power normalization layer with probabilistic and spectral interpretations, including a fast spectral MaxExp variant, for enhanced feature pooling in deep networks.
Findings
Improved classification accuracy on fine-grained, scene, and material datasets.
Effective in few-shot learning and graph classification scenarios.
Provides theoretical insights into power normalization functions and spectral properties.
Abstract
Power Normalizations (PN) are useful non-linear operators which tackle feature imbalances in classification problems. We study PNs in the deep learning setup via a novel PN layer pooling feature maps. Our layer combines the feature vectors and their respective spatial locations in the feature maps produced by the last convolutional layer of CNN into a positive definite matrix with second-order statistics to which PN operators are applied, forming so-called Second-order Pooling (SOP). As the main goal of this paper is to study Power Normalizations, we investigate the role and meaning of MaxExp and Gamma, two popular PN functions. To this end, we provide probabilistic interpretations of such element-wise operators and discover surrogates with well-behaved derivatives for end-to-end training. Furthermore, we look at the spectral applicability of MaxExp and Gamma by studying Spectral Power…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
