Information Screening whilst Exploiting! Multimodal Relation Extraction   with Feature Denoising and Multimodal Topic Modeling

Shengqiong Wu; Hao Fei; Yixin Cao; Lidong Bing; Tat-Seng Chua

arXiv:2305.11719·cs.CV·May 26, 2023·21 cites

Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling

Shengqiong Wu, Hao Fei, Yixin Cao, Lidong Bing, Tat-Seng Chua

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel multimodal relation extraction framework that combines internal-information screening through graph-based denoising and external-information exploitation via multimodal topic modeling, significantly improving performance on benchmark datasets.

Contribution

The proposed framework uniquely integrates graph-based feature denoising with multimodal topic modeling for enhanced relation extraction.

Findings

01

Outperforms current best models on benchmark datasets

02

Effectively denoises less-informative features using graph information bottleneck

03

Enriches context with latent multimodal topic features

Abstract

Existing research on multimodal relation extraction (MRE) faces two co-existing challenges, internal-information over-utilization and external-information under-exploitation. To combat that, we propose a novel framework that simultaneously implements the idea of internal-information screening and external-information exploiting. First, we represent the fine-grained semantic structures of the input image and text with the visual and textual scene graphs, which are further fused into a unified cross-modal graph (CMG). Based on CMG, we perform structure refinement with the guidance of the graph information bottleneck principle, actively denoising the less-informative features. Next, we perform topic modeling over the input image and text, incorporating latent multimodal topic features to enrich the contexts. On the benchmark MRE dataset, our system outperforms the current best model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chocowu/mre-ise
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Text and Document Classification Technologies