Behavior and performance of the deep belief networks on image classification
Karol Gregor, Gregory Griffin

TL;DR
This paper investigates the behavior and performance of deep belief networks on image classification tasks, demonstrating benefits of pre-training, transferability of learned features, and comparing results with other methods.
Contribution
It shows that pre-training accelerates learning, can be done on different datasets, and that deep belief networks can effectively classify images with competitive performance.
Findings
Pre-training speeds up supervised learning.
Pre-training on different datasets still yields good performance.
Deep belief networks perform comparably to SVMs and spatial pyramidal matching.
Abstract
We apply deep belief networks of restricted Boltzmann machines to bags of words of sift features obtained from databases of 13 Scenes, 15 Scenes and Caltech 256 and study experimentally their behavior and performance. We find that the final performance in the supervised phase is reached much faster if the system is pre-trained. Pre-training the system on a larger dataset keeping the supervised dataset fixed improves the performance (for the 13 Scenes case). After the unsupervised pre-training, neurons arise that form approximate explicit representations for several categories (meaning they are mostly active for this category). The last three facts suggest that unsupervised training really discovers structure in these data. Pre-training can be done on a completely different dataset (we use Corel dataset) and we find that the supervised phase performs just as good (on the 15 Scenes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Domain Adaptation and Few-Shot Learning
