TL;DR
The paper introduces the Matrix Shuffle-Exchange network, a neural model that efficiently captures long-range dependencies in 2D data, surpassing traditional CNNs and GNNs in complex reasoning tasks while maintaining comparable speed.
Contribution
It presents a novel neural architecture derived from Neural Shuffle-Exchange networks with logarithmic depth and complexity, optimized for large-scale 2D reasoning tasks.
Findings
Outperforms CNNs and GNNs on matrix and graph reasoning tasks.
Maintains full long-range dependency modeling for larger instances.
Achieves comparable speed to traditional CNNs.
Abstract
Convolutional neural networks have become the main tools for processing two-dimensional data. They work well for images, yet convolutions have a limited receptive field that prevents its applications to more complex 2D tasks. We propose a new neural model, called Matrix Shuffle-Exchange network, that can efficiently exploit long-range dependencies in 2D data and has comparable speed to a convolutional neural network. It is derived from Neural Shuffle-Exchange network and has layers and total time and space complexity for processing a data matrix. We show that the Matrix Shuffle-Exchange network is well-suited for algorithmic and logical reasoning tasks on matrices and dense graphs, exceeding convolutional and graph neural network baselines. Its distinct advantage is the capability of retaining full long-range dependency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGraph Neural Network · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
