A model for full local image interpretation
Guy Ben-Yosef, Liav Assif, Daniel Harari, Shimon Ullman

TL;DR
This paper proposes a computational model that enhances scene interpretation by combining initial recognition with class-specific validation, aiming to better emulate human detailed visual understanding.
Contribution
It introduces a novel two-stage model integrating recognition and interpretation to improve scene analysis beyond current feed-forward approaches.
Findings
The model achieves richer scene interpretation compared to traditional methods.
It highlights the importance of top-down processes in visual recognition.
The approach suggests pathways for improving computer vision systems.
Abstract
We describe a computational model of humans' ability to provide a detailed interpretation of components in a scene. Humans can identify in an image meaningful components almost everywhere, and identifying these components is an essential part of the visual process, and of understanding the surrounding scene and its potential meaning to the viewer. Detailed interpretation is beyond the scope of current models of visual recognition. Our model suggests that this is a fundamental limitation, related to the fact that existing models rely on feed-forward but limited top-down processing. In our model, a first recognition stage leads to the initial activation of class candidates, which is incomplete and with limited accuracy. This stage then triggers the application of class-specific interpretation and validation processes, which recover richer and more accurate interpretation of the visible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection · Image Retrieval and Classification Techniques
