Beyond Object Recognition: A New Benchmark towards Object Concept Learning
Yong-Lu Li, Yue Xu, Xinyu Xu, Xiaohan Mao, Yuan Yao, Siqi Liu, Cewu Lu

TL;DR
This paper introduces a new challenging benchmark for object concept learning that requires AI systems to reason about object attributes and affordances, supported by a densely annotated knowledge base and a causal reasoning model.
Contribution
It proposes the Object Concept Learning task, creates a comprehensive knowledge base, and develops a causal reasoning network to improve understanding of object attributes and affordances.
Findings
OCRN effectively infers object concepts following causal relations
The knowledge base enables detailed object attribute and affordance annotations
The approach demonstrates promising results in causal reasoning for object understanding
Abstract
Understanding objects is a central building block of artificial intelligence, especially for embodied AI. Even though object recognition excels with deep learning, current machines still struggle to learn higher-level knowledge, e.g., what attributes an object has, and what can we do with an object. In this work, we propose a challenging Object Concept Learning (OCL) task to push the envelope of object understanding. It requires machines to reason out object affordances and simultaneously give the reason: what attributes make an object possesses these affordances. To support OCL, we build a densely annotated knowledge base including extensive labels for three levels of object concept (category, attribute, affordance), and the causal relations of three levels. By analyzing the causal structure of OCL, we present a baseline, Object Concept Reasoning Network (OCRN). It leverages causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Multimodal Machine Learning Applications
MethodsBalanced Selection
