Rethinking Uncertainly Missing and Ambiguous Visual Modality in   Multi-Modal Entity Alignment

Zhuo Chen; Lingbing Guo; Yin Fang; Yichi Zhang; Jiaoyan Chen; Jeff Z.; Pan; Yangning Li; Huajun Chen; Wen Zhang

arXiv:2307.16210·cs.AI·August 2, 2023·2 cites

Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity Alignment

Zhuo Chen, Lingbing Guo, Yin Fang, Yichi Zhang, Jiaoyan Chen, Jeff Z., Pan, Yangning Li, Huajun Chen, Wen Zhang

PDF

Open Access 1 Repo

TL;DR

This paper analyzes the challenges of missing and ambiguous visual data in multi-modal entity alignment, introduces a robust method called UMAEA, and demonstrates its superior performance on a new benchmark dataset.

Contribution

The paper presents UMAEA, a novel approach that effectively handles visual modality incompleteness in multi-modal entity alignment, outperforming existing methods on a comprehensive benchmark.

Findings

01

UMAEA achieves state-of-the-art results on 97 benchmark splits.

02

Models tend to overfit noise and perform poorly with high missing modality rates.

03

Additional multi-modal data can sometimes harm entity alignment performance.

Abstract

As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to identify identical entities across disparate knowledge graphs (KGs) by exploiting associated visual information. However, existing MMEA approaches primarily concentrate on the fusion paradigm of multi-modal entity features, while neglecting the challenges presented by the pervasive phenomenon of missing and intrinsic ambiguity of visual images. In this paper, we present a further analysis of visual modality incompleteness, benchmarking latest MMEA models on our proposed dataset MMEA-UMVM, where the types of alignment KGs covering bilingual and monolingual, with standard (non-iterative) and iterative training paradigms to evaluate the model performance. Our research indicates that, in the face of modality incompleteness, models succumb to overfitting the modality noise, and exhibit performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zjukg/umaea
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning