XS-VID: An Extremely Small Video Object Detection Dataset
Jiahao Guo, Ziyang Xu, Lianjun Wu, Fei Gao, Wenyu Liu, Xinggang Wang

TL;DR
The paper introduces XS-VID, a new dataset for small video object detection focusing on extremely small objects, and proposes YOLOFT, a method that improves detection accuracy for these challenging cases.
Contribution
It provides the first extensive dataset for extremely small objects in video detection and develops YOLOFT, a novel detection method that enhances feature association and temporal integration.
Findings
Existing methods underperform on small object detection.
XS-VID covers diverse scenes and tiny objects, enriching the dataset resources.
YOLOFT significantly improves detection accuracy and stability.
Abstract
Small Video Object Detection (SVOD) is a crucial subfield in modern computer vision, essential for early object discovery and detection. However, existing SVOD datasets are scarce and suffer from issues such as insufficiently small objects, limited object categories, and lack of scene diversity, leading to unitary application scenarios for corresponding methods. To address this gap, we develop the XS-VID dataset, which comprises aerial data from various periods and scenes, and annotates eight major object categories. To further evaluate existing methods for detecting extremely small objects, XS-VID extensively collects three types of objects with smaller pixel areas: extremely small (\textit{es}, ), relatively small (\textit{rs}, ), and generally small (\textit{gs}, ). XS-VID offers unprecedented breadth and depth in covering and quantifying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Brain Tumor Detection and Classification
