VisionKG: Unleashing the Power of Visual Datasets via Knowledge Graph
Jicheng Yuan, Anh Le-Tuan, Manh Nguyen-Duc, Trung-Kien Tran, Manfred, Hauswirth, Danh Le-Phuoc

TL;DR
VisionKG is a knowledge graph framework that unifies and manages diverse visual datasets using Semantic Web technologies, enabling easier access, querying, and semantic enrichment for computer vision research.
Contribution
It introduces a knowledge-based approach to organize visual datasets, linking 40 million entities across 30 datasets, surpassing existing metadata-based methods.
Findings
Contains 519 million RDF triples describing 40 million entities
Integrates 30 datasets and 4 CV tasks for versatile applications
Enables semantic enrichment and efficient querying of visual data
Abstract
The availability of vast amounts of visual data with heterogeneous features is a key factor for developing, testing, and benchmarking of new computer vision (CV) algorithms and architectures. Most visual datasets are created and curated for specific tasks or with limited image data distribution for very specific situations, and there is no unified approach to manage and access them across diverse sources, tasks, and taxonomies. This not only creates unnecessary overheads when building robust visual recognition systems, but also introduces biases into learning systems and limits the capabilities of data-centric AI. To address these problems, we propose the Vision Knowledge Graph (VisionKG), a novel resource that interlinks, organizes and manages visual datasets via knowledge graphs and Semantic Web technologies. It can serve as a unified framework facilitating simple access and querying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
