Learnable Pooling Regions for Image Classification

Mateusz Malinowski; Mario Fritz

arXiv:1301.3516·cs.CV·May 6, 2015·ICLR·25 cites

Learnable Pooling Regions for Image Classification

Mateusz Malinowski, Mario Fritz

PDF

Open Access

TL;DR

This paper introduces a learnable pooling scheme for image classification that adapts to specific tasks, improving performance over traditional hand-crafted pooling methods on CIFAR datasets.

Contribution

It proposes a flexible, task-dependent pooling model that includes existing schemes as special cases and demonstrates the importance of smooth regularization for optimal performance.

Findings

01

Achieved 56.29% accuracy on CIFAR-100, surpassing previous methods.

02

Demonstrated the effectiveness of learnable pooling over fixed schemes.

03

Provided an efficient, parallel training approach for the model.

Abstract

Biologically inspired, from the early HMAX model to Spatial Pyramid Matching, pooling has played an important role in visual recognition pipelines. Spatial pooling, by grouping of local codes, equips these methods with a certain degree of robustness to translation and deformation yet preserving important spatial information. Despite the predominance of this approach in current recognition systems, we have seen little progress to fully adapt the pooling strategy to the task at hand. This paper proposes a model for learning task dependent pooling scheme -- including previously proposed hand-crafted pooling schemes as a particular instantiation. In our work, we investigate the role of different regularization terms showing that the smooth regularization term is crucial to achieve strong performance using the presented architecture. Finally, we propose an efficient and parallel method to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications