Better Exploiting OS-CNNs for Better Event Recognition in Images

Limin Wang; Zhe Wang; Sheng Guo; Yu Qiao

arXiv:1510.03979·cs.CV·October 15, 2015·1 cites

Better Exploiting OS-CNNs for Better Event Recognition in Images

Limin Wang, Zhe Wang, Sheng Guo, Yu Qiao

PDF

Open Access

TL;DR

This paper improves cultural event recognition in images by leveraging OS-CNNs, combining object and scene features, and exploring various application scenarios, leading to competitive results in a major challenge.

Contribution

It introduces a novel approach of exploiting OS-CNNs as both feature extractors and end-to-end predictors for event recognition, enhancing performance over previous methods.

Findings

01

OS-CNNs' global and local features are complementary

02

Proposed scenarios improve event recognition accuracy

03

Achieved third place in ICCV LAP challenge 2015

Abstract

Event recognition from still images is one of the most important problems for image understanding. However, compared with object recognition and scene recognition, event recognition has received much less research attention in computer vision community. This paper addresses the problem of cultural event recognition in still images and focuses on applying deep learning methods on this problem. In particular, we utilize the successful architecture of Object-Scene Convolutional Neural Networks (OS-CNNs) to perform event recognition. OS-CNNs are composed of object nets and scene nets, which transfer the learned representations from the pre-trained models on large-scale object and scene recognition datasets, respectively. We propose four types of scenarios to explore OS-CNNs for event recognition by treating them as either "end-to-end event predictors" or "generic feature extractors". Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Human Pose and Action Recognition