Heri-Graphs: A Workflow of Creating Datasets for Multi-modal Machine Learning on Graphs of Heritage Values and Attributes with Social Media
Nan Bai, Pirouz Nourian, Renqian Luo, Ana Pereira Roders

TL;DR
This paper introduces Heri-Graphs, a workflow for creating multi-modal datasets from social media data to facilitate graph-based machine learning on cultural heritage values and attributes across different cities.
Contribution
It presents a novel methodological workflow for constructing multi-modal heritage datasets from social media, integrating heterogeneous data into graph structures for ML applications.
Findings
Datasets created for Amsterdam, Suzhou, and Venice show high consistency.
The workflow effectively models visual, textual, social, and spatial data as graph features.
Ready-to-use mathematical formalization for heritage-related ML tasks.
Abstract
Values (why to conserve) and Attributes (what to conserve) are essential concepts of cultural heritage. Recent studies have been using social media to map values and attributes conveyed by public to cultural heritage. However, it is rare to connect heterogeneous modalities of images, texts, geo-locations, timestamps, and social network structures to mine the semantic and structural characteristics therein. This study presents a methodological workflow for constructing such multi-modal datasets using posts and images on Flickr for graph-based machine learning (ML) tasks concerning heritage values and attributes. After data pre-processing using state-of-the-art ML models, the multi-modal information of visual contents and textual semantics are modelled as node features and labels, while their social relationships and spatiotemporal contexts are modelled as links in Multi-Graphs. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Cultural Heritage Management and Preservation
