Structured learning and detailed interpretation of minimal object images
Guy Ben-Yosef, Liav Assif, Shimon Ullman

TL;DR
This paper presents a structured learning approach to model human interpretation of minimal object images, focusing on identifying semantic features and parts through local regions and primitive relations.
Contribution
It introduces a novel framework that models human-like interpretation by analyzing minimal configurations and their relations, advancing understanding of visual recognition.
Findings
Model successfully predicts human interpretation of minimal images.
Identifies key relations that facilitate local interpretation.
Experimental validation shows improved interpretability predictions.
Abstract
We model the process of human full interpretation of object images, namely the ability to identify and localize all semantic features and parts that are recognized by human observers. The task is approached by dividing the interpretation of the complete object to the interpretation of multiple reduced but interpretable local regions. We model interpretation by a structured learning framework, in which there are primitive components and relations that play a useful role in local interpretation by humans. To identify useful components and relations used in the interpretation process, we consider the interpretation of minimal configurations, namely reduced local regions that are minimal in the sense that further reduction will turn them unrecognizable and uninterpretable. We show experimental results of our model, and results of predicting and testing relations that were useful to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning
