
TL;DR
This paper introduces Graph Mixer Networks, a new architecture inspired by MLP-Mixers, designed to improve computational efficiency and performance in graph-structured data tasks, outperforming existing Graph Transformer models.
Contribution
The paper proposes the Graph Mixer Network (GMN), applying MLP-Mixer principles to graph data, and demonstrates its superior performance over Graph Transformers.
Findings
GMN outperforms Graph Transformers in experiments.
GMN reduces computational cost compared to Transformer-based models.
Source code is publicly available.
Abstract
In recent years, the attention mechanism has demonstrated superior performance in various tasks, leading to the emergence of GAT and Graph Transformer models that utilize this mechanism to extract relational information from graph-structured data. However, the high computational cost associated with the Transformer block, as seen in Vision Transformers, has motivated the development of alternative architectures such as MLP-Mixers, which have been shown to improve performance in image tasks while reducing the computational cost. Despite the effectiveness of Transformers in graph-based tasks, their computational efficiency remains a concern. The logic behind MLP-Mixers, which addresses this issue in image tasks, has the potential to be applied to graph-structured data as well. In this paper, we propose the Graph Mixer Network (GMN), also referred to as Graph Nasreddin Nets (GNasNets), a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Visual Attention and Saliency Detection · Brain Tumor Detection and Classification
MethodsAttention Is All You Need · Laplacian EigenMap · Linear Layer · Laplacian Positional Encodings · Softmax · Absolute Position Encodings · Graph Transformer · Byte Pair Encoding · Adam · Layer Normalization
