OD-VIRAT: A Large-Scale Benchmark for Object Detection in Realistic Surveillance Environments
Hayat Ullah, Abbas Khan, Arslan Munir, and Hari Kalva

TL;DR
This paper introduces OD-VIRAT, a large-scale benchmark dataset for object detection in surveillance environments, and evaluates state-of-the-art detection models on it to assess their robustness in complex real-world scenarios.
Contribution
It provides two comprehensive surveillance datasets with extensive annotations and benchmarks multiple recent object detection architectures in challenging conditions.
Findings
State-of-the-art models show varied performance on surveillance data.
Benchmarking reveals challenges in detecting small and occluded objects.
The dataset facilitates future research in robust surveillance object detection.
Abstract
Realistic human surveillance datasets are crucial for training and evaluating computer vision models under real-world conditions, facilitating the development of robust algorithms for human and human-interacting object detection in complex environments. These datasets need to offer diverse and challenging data to enable a comprehensive assessment of model performance and the creation of more reliable surveillance systems for public safety. To this end, we present two visual object detection benchmarks named OD-VIRAT Large and OD-VIRAT Tiny, aiming at advancing visual understanding tasks in surveillance imagery. The video sequences in both benchmarks cover 10 different scenes of human surveillance recorded from significant height and distance. The proposed benchmarks offer rich annotations of bounding boxes and categories, where OD-VIRAT Large has 8.7 million annotated instances in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Visual Attention and Saliency Detection
