Better Exploiting OS-CNNs for Better Event Recognition in Images
Limin Wang, Zhe Wang, Sheng Guo, Yu Qiao

TL;DR
This paper improves cultural event recognition in images by leveraging OS-CNNs, combining object and scene features, and exploring various application scenarios, leading to competitive results in a major challenge.
Contribution
It introduces a novel approach of exploiting OS-CNNs as both feature extractors and end-to-end predictors for event recognition, enhancing performance over previous methods.
Findings
OS-CNNs' global and local features are complementary
Proposed scenarios improve event recognition accuracy
Achieved third place in ICCV LAP challenge 2015
Abstract
Event recognition from still images is one of the most important problems for image understanding. However, compared with object recognition and scene recognition, event recognition has received much less research attention in computer vision community. This paper addresses the problem of cultural event recognition in still images and focuses on applying deep learning methods on this problem. In particular, we utilize the successful architecture of Object-Scene Convolutional Neural Networks (OS-CNNs) to perform event recognition. OS-CNNs are composed of object nets and scene nets, which transfer the learned representations from the pre-trained models on large-scale object and scene recognition datasets, respectively. We propose four types of scenarios to explore OS-CNNs for event recognition by treating them as either "end-to-end event predictors" or "generic feature extractors". Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Human Pose and Action Recognition
