Places205-VGGNet Models for Scene Recognition
Limin Wang, Sheng Guo, Weilin Huang, Yu Qiao

TL;DR
This paper trains VGGNet models on the large-scale Places205 dataset for improved scene recognition, achieving state-of-the-art results and demonstrating the models' effectiveness across multiple datasets.
Contribution
It presents the training of three VGGNet models on Places205 dataset and verifies their superior performance on scene recognition tasks.
Findings
Achieved state-of-the-art performance on MIT67, SUN397, and Places205 datasets.
Successfully trained VGGNet-11, VGGNet-13, and VGGNet-16 models for scene recognition.
Models are made publicly available for research use.
Abstract
VGGNets have turned out to be effective for object recognition in still images. However, it is unable to yield good performance by directly adapting the VGGNet models trained on the ImageNet dataset for scene recognition. This report describes our implementation of training the VGGNets on the large-scale Places205 dataset. Specifically, we train three VGGNet models, namely VGGNet-11, VGGNet-13, and VGGNet-16, by using a Multi-GPU extension of Caffe toolbox with high computational efficiency. We verify the performance of trained Places205-VGGNet models on three datasets: MIT67, SUN397, and Places205. Our trained models achieve the state-of-the-art performance on these datasets and are made public available.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
