TMTE: Effective Multimodal Graph Learning with Task-aware Modality and Topology Co-evolution

Yinlin Zhu; Xunkai Li; Di Wu; Wang Luo; Miao Hu; Di Wu

arXiv:2603.27723·cs.LG·March 31, 2026

TMTE: Effective Multimodal Graph Learning with Task-aware Modality and Topology Co-evolution

Yinlin Zhu, Xunkai Li, Di Wu, Wang Luo, Miao Hu, Di Wu

PDF

1 Repo

TL;DR

TMTE introduces a novel framework for multimodal graph learning that jointly optimizes graph topology and modality representations, addressing real-world MAG limitations and improving task performance.

Contribution

It proposes a task-aware co-evolution approach for topology and modality, with a bidirectional coupling mechanism and a closed-loop optimization process.

Findings

01

Achieves state-of-the-art results on 9 MAG datasets and 1 non-graph dataset.

02

Effectively addresses noise and missing connections in real-world MAGs.

03

Demonstrates consistent improvements across diverse graph-centric and modality-centric tasks.

Abstract

Multimodal-attributed graphs (MAGs) are a fundamental data structure for multimodal graph learning (MGL), enabling both graph-centric and modality-centric tasks. However, our empirical analysis reveals inherent topology quality limitations in real-world MAGs, including noisy interactions, missing connections, and task-agnostic relational structures. A single graph derived from generic relationships is therefore unlikely to be universally optimal for diverse downstream tasks. To address this challenge, we propose Task-aware Modality and Topology co-Evolution (TMTE), a novel MGL framework that jointly and iteratively optimizes graph topology and multimodal representations toward the target task. TMTE is motivated by the bidirectional coupling between modality and topology: multimodal attributes induce relational structures, while graph topology shapes modality representations. Concretely,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://anonymous.4open.science/r/TMTE-1873
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.