Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity Alignment
Zhuo Chen, Lingbing Guo, Yin Fang, Yichi Zhang, Jiaoyan Chen, Jeff Z., Pan, Yangning Li, Huajun Chen, Wen Zhang

TL;DR
This paper analyzes the challenges of missing and ambiguous visual data in multi-modal entity alignment, introduces a robust method called UMAEA, and demonstrates its superior performance on a new benchmark dataset.
Contribution
The paper presents UMAEA, a novel approach that effectively handles visual modality incompleteness in multi-modal entity alignment, outperforming existing methods on a comprehensive benchmark.
Findings
UMAEA achieves state-of-the-art results on 97 benchmark splits.
Models tend to overfit noise and perform poorly with high missing modality rates.
Additional multi-modal data can sometimes harm entity alignment performance.
Abstract
As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to identify identical entities across disparate knowledge graphs (KGs) by exploiting associated visual information. However, existing MMEA approaches primarily concentrate on the fusion paradigm of multi-modal entity features, while neglecting the challenges presented by the pervasive phenomenon of missing and intrinsic ambiguity of visual images. In this paper, we present a further analysis of visual modality incompleteness, benchmarking latest MMEA models on our proposed dataset MMEA-UMVM, where the types of alignment KGs covering bilingual and monolingual, with standard (non-iterative) and iterative training paradigms to evaluate the model performance. Our research indicates that, in the face of modality incompleteness, models succumb to overfitting the modality noise, and exhibit performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
