Matrix Shuffle-Exchange Networks for Hard 2D Tasks

Em\=ils Ozoli\c{n}\v{s}; K\=arlis Freivalds; Agris \v{S}ostaks

arXiv:2006.15892·cs.LG·October 6, 2020

Matrix Shuffle-Exchange Networks for Hard 2D Tasks

Em\=ils Ozoli\c{n}\v{s}, K\=arlis Freivalds, Agris \v{S}ostaks

PDF

2 Repos

TL;DR

The paper introduces the Matrix Shuffle-Exchange network, a neural model that efficiently captures long-range dependencies in 2D data, surpassing traditional CNNs and GNNs in complex reasoning tasks while maintaining comparable speed.

Contribution

It presents a novel neural architecture derived from Neural Shuffle-Exchange networks with logarithmic depth and complexity, optimized for large-scale 2D reasoning tasks.

Findings

01

Outperforms CNNs and GNNs on matrix and graph reasoning tasks.

02

Maintains full long-range dependency modeling for larger instances.

03

Achieves comparable speed to traditional CNNs.

Abstract

Convolutional neural networks have become the main tools for processing two-dimensional data. They work well for images, yet convolutions have a limited receptive field that prevents its applications to more complex 2D tasks. We propose a new neural model, called Matrix Shuffle-Exchange network, that can efficiently exploit long-range dependencies in 2D data and has comparable speed to a convolutional neural network. It is derived from Neural Shuffle-Exchange network and has $O (lo g n)$ layers and $O (n^{2} lo g n)$ total time and space complexity for processing a $n \times n$ data matrix. We show that the Matrix Shuffle-Exchange network is well-suited for algorithmic and logical reasoning tasks on matrices and dense graphs, exceeding convolutional and graph neural network baselines. Its distinct advantage is the capability of retaining full long-range dependency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGraph Neural Network · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings