Generalized Sum Pooling for Metric Learning
Yeti Z. Gurbuz, Ozan Sener, A. Ayd{\i}n Alatan

TL;DR
This paper introduces a learnable generalized sum pooling (GSP) method for deep metric learning, enhancing global average pooling by enabling selective focus on semantic features and importance weighting, leading to improved performance.
Contribution
It proposes a novel GSP method based on entropy-smoothed optimal transport that generalizes GAP and is fully differentiable for end-to-end learning.
Findings
GSP outperforms GAP on 4 metric learning benchmarks.
The method effectively learns to ignore nuisance information.
GSP provides a flexible, learnable pooling alternative to traditional GAP.
Abstract
A common architectural choice for deep metric learning is a convolutional neural network followed by global average pooling (GAP). Albeit simple, GAP is a highly effective way to aggregate information. One possible explanation for the effectiveness of GAP is considering each feature vector as representing a different semantic entity and GAP as a convex combination of them. Following this perspective, we generalize GAP and propose a learnable generalized sum pooling method (GSP). GSP improves GAP with two distinct abilities: i) the ability to choose a subset of semantic entities, effectively learning to ignore nuisance information, and ii) learning the weights corresponding to the importance of each entity. Formally, we propose an entropy-smoothed optimal transport problem and show that it is a strict generalization of GAP, i.e., a specific realization of the problem gives back GAP. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Generalized Sum Pooling for Metric Learning· youtube
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
MethodsAverage Pooling · Global Average Pooling
