Improving Performance of Object Detection using the Mechanisms of Visual Recognition in Humans
Amir Ghasemi, Nasrin Bayat, Fatemeh Mottaghian, Akram Bayat

TL;DR
This paper investigates how low-resolution images affect object detection performance and introduces a multi-resolution framework inspired by human visual mechanisms, significantly improving accuracy across various image resolutions.
Contribution
The paper proposes a novel multi-resolution object recognition framework based on human visual recognition mechanisms, outperforming single-resolution models on standard benchmarks.
Findings
Multi-resolution approach improves detection accuracy by 9.14% mAP.
Low resolution negatively impacts recognition performance.
Multi-resolution model maintains robustness across spatial frequencies.
Abstract
Object recognition systems are usually trained and evaluated on high resolution images. However, in real world applications, it is common that the images have low resolutions or have small sizes. In this study, we first track the performance of the state-of-the-art deep object recognition network, Faster- RCNN, as a function of image resolution. The results reveals negative effects of low resolution images on recognition performance. They also show that different spatial frequencies convey different information about the objects in recognition process. It means multi-resolution recognition system can provides better insight into optimal selection of features that results in better recognition of objects. This is similar to the mechanisms of the human visual systems that are able to implement multi-scale representation of a visual scene simultaneously. Then, we propose a multi-resolution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Image Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques
