Building a visual semantics aware object hierarchy

Xiaolei Diao

arXiv:2202.13021·cs.CV·March 1, 2022

Building a visual semantics aware object hierarchy

Xiaolei Diao

PDF

Open Access

TL;DR

This paper introduces an unsupervised approach to construct a visual semantics aware object hierarchy that reduces linguistic bias and improves object recognition by learning from visual features alone.

Contribution

It presents a novel unsupervised method for building a visual semantic hierarchy to enhance classification and address semantic gap issues in computer vision.

Findings

01

The hierarchy improves object recognition accuracy.

02

The visual hierarchy outperforms lexical hierarchies.

03

Preliminary results show the method's efficiency.

Abstract

The semantic gap is defined as the difference between the linguistic representations of the same concept, which usually leads to misunderstanding between individuals with different knowledge backgrounds. Since linguistically annotated images are extensively used for training machine learning models, semantic gap problem (SGP) also results in inevitable bias on image annotations and further leads to poor performance on current computer vision tasks. To address this problem, we propose a novel unsupervised method to build visual semantics aware object hierarchy, aiming to get a classification model by learning from pure-visual information and to dissipate the bias of linguistic representations caused by SGP. Our intuition in this paper comes from real-world knowledge representation where concepts are hierarchically organized, and each concept can be described by a set of features rather…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsAttentive Walk-Aggregating Graph Neural Network