Generalized Max Pooling
Naila Murray, Florent Perronnin

TL;DR
This paper introduces Generalized Max Pooling, a new pooling method that balances the influence of frequent and rare descriptors, improving image classification performance across multiple benchmarks.
Contribution
The paper proposes a novel pooling mechanism applicable beyond traditional count-based methods, especially enhancing Fisher Vector representations.
Findings
GMP achieves significant performance improvements on five image classification benchmarks.
GMP effectively balances the influence of frequent and rare descriptors.
The method is applicable to Fisher Vector and other advanced image representations.
Abstract
State-of-the-art patch-based image representations involve a pooling operation that aggregates statistics computed from local descriptors. Standard pooling operations include sum- and max-pooling. Sum-pooling lacks discriminability because the resulting representation is strongly influenced by frequent yet often uninformative descriptors, but only weakly influenced by rare yet potentially highly-informative ones. Max-pooling equalizes the influence of frequent and rare descriptors but is only applicable to representations that rely on count statistics, such as the bag-of-visual-words (BOV) and its soft- and sparse-coding extensions. We propose a novel pooling mechanism that achieves the same effect as max-pooling but is applicable beyond the BOV and especially to the state-of-the-art Fisher Vector -- hence the name Generalized Max Pooling (GMP). It involves equalizing the similarity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Remote-Sensing Image Classification · Advanced Neural Network Applications
MethodsMax Pooling
