TL;DR
SHOP is a pipeline that enhances small handheld object detection in blurry videos by deblurring, pose-based interest region proposal, and filtering, significantly reducing false positives with minimal true positive loss.
Contribution
The paper introduces a flexible, multi-stage pipeline specifically designed for handheld object detection in blurry videos, improving accuracy over existing models.
Findings
70% reduction in false positives
17% decrease in true positives
Effective on a new handheld object subset of MS COCO
Abstract
While prior works have investigated and developed computational models capable of object detection, models still struggle to reliably interpret images with motion blur and small objects. Moreover, none of these models are specifically designed for handheld object detection. In this work, we present SHOP (Small Handheld Object Pipeline), a pipeline that reliably and efficiently interprets blurry images containing handheld objects. The specific models used in each stage of the pipeline are flexible and can be changed based on performance requirements. First, images are deblurred and then run through a pose detection system where areas-of-interest are proposed around the hands of any people present. Next, object detection is performed on the images by a single-stage object detector. Finally, the proposed areas-of-interest are used to filter out low confidence detections. Testing on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
