Learning on Multimodal Graphs: A Survey

Ciyuan Peng; Jiayuan He; Feng Xia

arXiv:2402.05322·cs.LG·February 9, 2024·2 cites

Learning on Multimodal Graphs: A Survey

Ciyuan Peng, Jiayuan He, Feng Xia

PDF

Open Access

TL;DR

This survey reviews the rapidly growing field of multimodal graph learning, analyzing various techniques, applications, and future directions to serve as a foundational resource for researchers in the domain.

Contribution

It provides a comprehensive comparative analysis of existing multimodal graph learning methods, highlighting their characteristics and application scenarios.

Findings

01

Diverse graph data types and modalities are effectively integrated.

02

Various learning techniques are characterized and compared.

03

Key applications across domains are identified and discussed.

Abstract

Multimodal data pervades various domains, including healthcare, social media, and transportation, where multimodal graphs play a pivotal role. Machine learning on multimodal graphs, referred to as multimodal graph learning (MGL), is essential for successful artificial intelligence (AI) applications. The burgeoning research in this field encompasses diverse graph data types and modalities, learning techniques, and application scenarios. This survey paper conducts a comparative analysis of existing works in multimodal graph learning, elucidating how multimodal learning is achieved across different graph types and exploring the characteristics of prevalent learning techniques. Additionally, we delineate significant applications of multimodal graph learning and offer insights into future directions in this domain. Consequently, this paper serves as a foundational resource for researchers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Speech and dialogue systems