Feature grouping from spatially constrained multiplicative interaction
Felix Bauer, Roland Memisevic

TL;DR
This paper introduces a Gated Boltzmann Machine that learns spatially constrained feature relationships in images, explaining feature grouping and reducing parameters in transformation models.
Contribution
It proposes a novel spatially constrained gating mechanism in a Gated Boltzmann Machine for encoding image relationships, with insights into feature grouping and regularization.
Findings
Frequency and orientation columns emerge from training.
Topographic filter maps naturally form.
Spatial gating reduces parameters and regularizes models.
Abstract
We present a feature learning model that learns to encode relationships between images. The model is defined as a Gated Boltzmann Machine, which is constrained such that hidden units that are nearby in space can gate each other's connections. We show how frequency/orientation "columns" as well as topographic filter maps follow naturally from training the model on image pairs. The model also helps explain why square-pooling models yield feature groups with similar grouping properties. Experimental results on synthetic image transformations show that spatially constrained gating is an effective way to reduce the number of parameters and thereby to regularize a transformation-learning model.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Visual perception and processing mechanisms · Advanced Vision and Imaging
