Large Field and High Resolution: Detecting Needle in Haystack
Hadar Gorodissky, Daniel Harari, Shimon Ullman

TL;DR
This paper introduces variable resolution CNN methods inspired by human vision for small target localization in large images, achieving high accuracy with significantly fewer samples than traditional full-resolution models.
Contribution
The paper proposes and compares variable resolution sampling schemes for CNNs, demonstrating their superiority over constant resolution and full-resolution models in target localization tasks.
Findings
Variable resolution models outperform constant resolution models.
Multi-channel variable resolution models outperform full-resolution models.
Achieve accurate localization using only 5% of the samples.
Abstract
The growing use of convolutional neural networks (CNN) for a broad range of visual tasks, including tasks involving fine details, raises the problem of applying such networks to a large field of view, since the amount of computations increases significantly with the number of pixels. To deal effectively with this difficulty, we develop and compare methods of using CNNs for the task of small target localization in natural images, given a limited "budget" of samples to form an image. Inspired in part by human vision, we develop and compare variable sampling schemes, with peak resolution at the center and decreasing resolution with eccentricity, applied iteratively by re-centering the image at the previous predicted target location. The results indicate that variable resolution models significantly outperform constant resolution models. Surprisingly, variable resolution models and in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
