Graph Mixer Networks

Ahmet Sar{\i}g\"un

arXiv:2301.12493·cs.LG·January 31, 2023

Graph Mixer Networks

Ahmet Sar{\i}g\"un

PDF

Open Access 1 Repo

TL;DR

This paper introduces Graph Mixer Networks, a new architecture inspired by MLP-Mixers, designed to improve computational efficiency and performance in graph-structured data tasks, outperforming existing Graph Transformer models.

Contribution

The paper proposes the Graph Mixer Network (GMN), applying MLP-Mixer principles to graph data, and demonstrates its superior performance over Graph Transformers.

Findings

01

GMN outperforms Graph Transformers in experiments.

02

GMN reduces computational cost compared to Transformer-based models.

03

Source code is publicly available.

Abstract

In recent years, the attention mechanism has demonstrated superior performance in various tasks, leading to the emergence of GAT and Graph Transformer models that utilize this mechanism to extract relational information from graph-structured data. However, the high computational cost associated with the Transformer block, as seen in Vision Transformers, has motivated the development of alternative architectures such as MLP-Mixers, which have been shown to improve performance in image tasks while reducing the computational cost. Despite the effectiveness of Transformers in graph-based tasks, their computational efficiency remains a concern. The logic behind MLP-Mixers, which addresses this issue in image tasks, has the potential to be applied to graph-structured data as well. In this paper, we propose the Graph Mixer Network (GMN), also referred to as Graph Nasreddin Nets (GNasNets), a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

asarigun/GraphMixerNetworks
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Visual Attention and Saliency Detection · Brain Tumor Detection and Classification

MethodsAttention Is All You Need · Laplacian EigenMap · Linear Layer · Laplacian Positional Encodings · Softmax · Absolute Position Encodings · Graph Transformer · Byte Pair Encoding · Adam · Layer Normalization