VisionKG: Unleashing the Power of Visual Datasets via Knowledge Graph

Jicheng Yuan; Anh Le-Tuan; Manh Nguyen-Duc; Trung-Kien Tran; Manfred; Hauswirth; Danh Le-Phuoc

arXiv:2309.13610·cs.CV·March 29, 2024

VisionKG: Unleashing the Power of Visual Datasets via Knowledge Graph

Jicheng Yuan, Anh Le-Tuan, Manh Nguyen-Duc, Trung-Kien Tran, Manfred, Hauswirth, Danh Le-Phuoc

PDF

Open Access

TL;DR

VisionKG is a knowledge graph framework that unifies and manages diverse visual datasets using Semantic Web technologies, enabling easier access, querying, and semantic enrichment for computer vision research.

Contribution

It introduces a knowledge-based approach to organize visual datasets, linking 40 million entities across 30 datasets, surpassing existing metadata-based methods.

Findings

01

Contains 519 million RDF triples describing 40 million entities

02

Integrates 30 datasets and 4 CV tasks for versatile applications

03

Enables semantic enrichment and efficient querying of visual data

Abstract

The availability of vast amounts of visual data with heterogeneous features is a key factor for developing, testing, and benchmarking of new computer vision (CV) algorithms and architectures. Most visual datasets are created and curated for specific tasks or with limited image data distribution for very specific situations, and there is no unified approach to manage and access them across diverse sources, tasks, and taxonomies. This not only creates unnecessary overheads when building robust visual recognition systems, but also introduces biases into learning systems and limits the capabilities of data-centric AI. To address these problems, we propose the Vision Knowledge Graph (VisionKG), a novel resource that interlinks, organizes and manages visual datasets via knowledge graphs and Semantic Web technologies. It can serve as a unified framework facilitating simple access and querying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques