A Pattern to Align Them All: Integrating Different Modalities to Define   Multi-Modal Entities

Gianluca Apriceno; Valentina Tamma; Tania Bailoni; Jacopo de; Berardinis; Mauro Dragoni

arXiv:2410.13803·cs.AI·October 18, 2024

A Pattern to Align Them All: Integrating Different Modalities to Define Multi-Modal Entities

Gianluca Apriceno, Valentina Tamma, Tania Bailoni, Jacopo de, Berardinis, Mauro Dragoni

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new ontology design pattern to unify and integrate diverse modalities in Multi-Modal Knowledge Graphs, enhancing reasoning and application across various domains.

Contribution

It proposes an abstract model that separates entity semantics from their physical representations, aiding the harmonization of existing multi-modal ontologies.

Findings

01

Facilitates integration of multi-modal ontologies

02

Supports reasoning with diverse sensory data

03

Enhances cross-domain applications

Abstract

The ability to reason with and integrate different sensory inputs is the foundation underpinning human intelligence and it is the reason for the growing interest in modelling multi-modal information within Knowledge Graphs. Multi-Modal Knowledge Graphs extend traditional Knowledge Graphs by associating an entity with its possible modal representations, including text, images, audio, and videos, all of which are used to convey the semantics of the entity. Despite the increasing attention that Multi-Modal Knowledge Graphs have received, there is a lack of consensus about the definitions and modelling of modalities, whose definition is often determined by application domains. In this paper, we propose a novel ontology design pattern that captures the separation of concerns between an entity (and the information it conveys), whose semantics can have different manifestations across different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ida-fbk/multimodalpattern
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems

MethodsSoftmax · Attention Is All You Need · Ontology