You Better Look Twice: a new perspective for designing accurate detectors with reduced computations
Alexandra Dana, Maor Shutman, Yotam Perlitz, Ran Vitek, Tomer Peleg,, Roy J Jevnisek

TL;DR
BLT-net is a novel two-stage object detection architecture that significantly reduces computational costs by efficiently separating background from objects and dynamically adjusting processing resolution, especially effective for scenes with varied object sizes.
Contribution
Introduces BLT-net, a low-computation, two-stage detection architecture that improves efficiency by separating background and dynamically adjusting resolution based on object scale.
Findings
Reduces computations by a factor of 4-7 on pedestrian datasets.
Maintains high detection accuracy with minimal degradation.
Applicable to scenes with high background and varied object sizes.
Abstract
General object detectors use powerful backbones that uniformly extract features from images for enabling detection of a vast amount of object types. However, utilization of such backbones in object detection applications developed for specific object types can unnecessarily over-process an extensive amount of background. In addition, they are agnostic to object scales, thus redundantly process all image regions at the same resolution. In this work we introduce BLT-net, a new low-computation two-stage object detection architecture designed to process images with a significant amount of background and objects of variate scales. BLT-net reduces computations by separating objects from background using a very lite first-stage. BLT-net then efficiently merges obtained proposals to further decrease processed background and then dynamically reduces their resolution to minimize computations.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
