DRIN: Dynamic Relation Interactive Network for Multimodal Entity Linking

Shangyu Xing; Fei Zhao; Zhen Wu; Chunhui Li; Jianbing Zhang; Xinyu Dai

arXiv:2310.05589·cs.CL·October 10, 2023·1 cites

DRIN: Dynamic Relation Interactive Network for Multimodal Entity Linking

Shangyu Xing, Fei Zhao, Zhen Wu, Chunhui Li, Jianbing Zhang, Xinyu Dai

PDF

Open Access 1 Repo

TL;DR

This paper introduces DRIN, a novel dynamic network that models fine-grained, relation-specific alignments between mentions and entities in multimodal contexts, significantly improving MEL performance.

Contribution

The paper proposes a dynamic GCN-based framework that explicitly models multiple alignment types and adaptively selects relations for better multimodal entity linking.

Findings

01

DRIN outperforms state-of-the-art methods on two datasets.

02

Explicit relation modeling improves alignment accuracy.

03

Dynamic selection enhances performance on complex data.

Abstract

Multimodal Entity Linking (MEL) is a task that aims to link ambiguous mentions within multimodal contexts to referential entities in a multimodal knowledge base. Recent methods for MEL adopt a common framework: they first interact and fuse the text and image to obtain representations of the mention and entity respectively, and then compute the similarity between them to predict the correct entity. However, these methods still suffer from two limitations: first, as they fuse the features of text and image before matching, they cannot fully exploit the fine-grained alignment relations between the mention and entity. Second, their alignment is static, leading to low performance when dealing with complex and diverse data. To address these issues, we propose a novel framework called Dynamic Relation Interactive Network (DRIN) for MEL tasks. DRIN explicitly models four different types of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

starreeze/drin
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks