Persistent Evidence of Local Image Properties in Generic ConvNets
Ali Sharif Razavian, Hossein Azizpour, Atsuto Maki, Josephine, Sullivan, Carl Henrik Ek, Stefan Carlsson

TL;DR
This paper reveals that early layers of convolutional networks retain strong spatial information, which can be exploited for various visual tasks, challenging the assumption that training discards auxiliary image details.
Contribution
The study demonstrates persistent local image properties in generic ConvNets' early layers across multiple tasks, highlighting potential for improved training strategies.
Findings
Strong spatial information exists in early ConvNet layers.
Global image descriptors can predict landmarks and pixel labels.
Spatial info can enhance correspondence and training methods.
Abstract
Supervised training of a convolutional network for object classification should make explicit any information related to the class of objects and disregard any auxiliary information associated with the capture of the image or the variation within the object class. Does this happen in practice? Although this seems to pertain to the very final layers in the network, if we look at earlier layers we find that this is not the case. Surprisingly, strong spatial information is implicit. This paper addresses this, in particular, exploiting the image representation at the first fully connected layer, i.e. the global image descriptor which has been recently shown to be most effective in a range of visual recognition tasks. We empirically demonstrate evidences for the finding in the contexts of four different tasks: 2d landmark detection, 2d object keypoints prediction, estimation of the RGB…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
