Multimodal Cultural Heritage Knowledge Graph Extension with Language and Vision Models

Yang Zhang,Nada Mimouni,Jean-Claude Moissinac,Fay\c{c}al Hamdi

arXiv:2605.17669·cs.AI·May 19, 2026

Multimodal Cultural Heritage Knowledge Graph Extension with Language and Vision Models

Yang Zhang,Nada Mimouni,Jean-Claude Moissinac,Fay\c{c}al Hamdi

PDF

1 Repo

TL;DR

This paper presents a multimodal knowledge graph extension approach for cultural heritage data, integrating text and images with large language and vision models, demonstrated on the French WJoconde dataset.

Contribution

It introduces a new multimodal cultural heritage knowledge graph, variants for research, a benchmark for knowledge graph completion, and a framework leveraging LLMs and VLMs for KG extension.

Findings

01

Enhanced KGs with high reliability through multimodal data integration

02

Built a comprehensive benchmark for KGC on cultural heritage data

03

Open-sourced code, datasets, and interactive data access

Abstract

The preservation and interpretation of cultural heritage increasingly rely on digital technologies, among which Knowledge Graphs (KGs) stand out for their ability to structure vast amounts of data. However, the construction and expansion of these KGs often face challenges due to the diverse and complex nature of cultural heritage information. In this paper, we propose a novel approach for extending KG resources in the domain of cultural heritage, which we applied to French data. First, we introduce a new knowledge graph in the domain of French cultural heritage, WJoconde, which is distinguished by its multimodality as it integrates both textual and image information of the entities. We further introduce three variants of WJoconde to facilitate downstream research, such as Knowledge Graph Completion (KGC). We also built a comprehensive benchmark for KGC methods on our dataset. Second, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

null
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.