Untangling Local and Global Deformations in Deep Convolutional Networks for Image Classification and Sliding Window Detection
George Papandreou, Iasonas Kokkinos, Pierre-Andr\'e Savalle

TL;DR
This paper proposes novel methods for handling local and global deformations in deep convolutional networks, introducing epitomic convolution, a multiple instance learning approach with a patchwork structure, and an efficient sliding window detector, leading to improved classification and detection performance.
Contribution
It introduces epitomic convolution for better parameter sharing, a multiple instance learning framework with a patchwork structure for global deformations, and an efficient sliding window detector for object localization.
Findings
Improved classification accuracy on ImageNet.
Competitive object detection results on Pascal VOC 2007.
Faster convergence and better generalization with epitomic convolution.
Abstract
Deep Convolutional Neural Networks (DCNNs) commonly use generic `max-pooling' (MP) layers to extract deformation-invariant features, but we argue in favor of a more refined treatment. First, we introduce epitomic convolution as a building block alternative to the common convolution-MP cascade of DCNNs; while having identical complexity to MP, Epitomic Convolution allows for parameter sharing across different filters, resulting in faster convergence and better generalization. Second, we introduce a Multiple Instance Learning approach to explicitly accommodate global translation and scaling when training a DCNN exclusively with class labels. For this we rely on a `patchwork' data structure that efficiently lays out all image scales and positions as candidates to a DCNN. Factoring global and local deformations allows a DCNN to `focus its resources' on the treatment of non-rigid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsConvolution
