ImageSubject: A Large-scale Dataset for Subject Detection
Xin Miao, Jiayi Liu, Huayan Wang, Jun Fu

TL;DR
This paper introduces ImageSubject, a large-scale dataset of 107,700 images from movie shots for training models to detect main subjects, highlighting the importance of understanding image layout and context.
Contribution
The creation of ImageSubject, the first dataset focused on localizing the main subject in images, with detailed analysis and comparison to related tasks like saliency and object detection.
Findings
Transformer-based models perform best on subject detection.
The dataset enables better understanding of image layout and context.
Subject detection differs from traditional object detection and saliency tasks.
Abstract
Main subjects usually exist in the images or videos, as they are the objects that the photographer wants to highlight. Human viewers can easily identify them but algorithms often confuse them with other objects. Detecting the main subjects is an important technique to help machines understand the content of images and videos. We present a new dataset with the goal of training models to understand the layout of the objects and the context of the image then to find the main subjects among them. This is achieved in three aspects. By gathering images from movie shots created by directors with professional shooting skills, we collect the dataset with strong diversity, specifically, it contains 107\,700 images from 21\,540 movie shots. We labeled them with the bounding box labels for two classes: subject and non-subject foreground object. We present a detailed analysis of the dataset and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
