Places205-VGGNet Models for Scene Recognition

Limin Wang; Sheng Guo; Weilin Huang; Yu Qiao

arXiv:1508.01667·cs.CV·August 10, 2015·134 cites

Places205-VGGNet Models for Scene Recognition

Limin Wang, Sheng Guo, Weilin Huang, Yu Qiao

PDF

Open Access 2 Repos

TL;DR

This paper trains VGGNet models on the large-scale Places205 dataset for improved scene recognition, achieving state-of-the-art results and demonstrating the models' effectiveness across multiple datasets.

Contribution

It presents the training of three VGGNet models on Places205 dataset and verifies their superior performance on scene recognition tasks.

Findings

01

Achieved state-of-the-art performance on MIT67, SUN397, and Places205 datasets.

02

Successfully trained VGGNet-11, VGGNet-13, and VGGNet-16 models for scene recognition.

03

Models are made publicly available for research use.

Abstract

VGGNets have turned out to be effective for object recognition in still images. However, it is unable to yield good performance by directly adapting the VGGNet models trained on the ImageNet dataset for scene recognition. This report describes our implementation of training the VGGNets on the large-scale Places205 dataset. Specifically, we train three VGGNet models, namely VGGNet-11, VGGNet-13, and VGGNet-16, by using a Multi-GPU extension of Caffe toolbox with high computational efficiency. We verify the performance of trained Places205-VGGNet models on three datasets: MIT67, SUN397, and Places205. Our trained models achieve the state-of-the-art performance on these datasets and are made public available.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning