MGiaD: Multigrid in all dimensions. Efficiency and robustness by   coarsening in resolution and channel dimensions

Antonia van Betteray; Matthias Rottmann; Karsten Kahl

arXiv:2211.05525·cs.CV·November 11, 2022

MGiaD: Multigrid in all dimensions. Efficiency and robustness by coarsening in resolution and channel dimensions

Antonia van Betteray, Matthias Rottmann, Karsten Kahl

PDF

Open Access

TL;DR

This paper introduces MGiaD, a multigrid-inspired neural network architecture that coarsens in resolution and channel dimensions, achieving linear weight complexity, reducing overfitting, and improving image classification performance.

Contribution

It combines multigrid ideas across all dimensions to create a more efficient and robust neural network architecture with significantly fewer weights.

Findings

01

Reduces weight count compared to traditional architectures.

02

Improves overfitting resistance and generalization.

03

Outperforms ResNet on standard benchmarks.

Abstract

Current state-of-the-art deep neural networks for image classification are made up of 10 - 100 million learnable weights and are therefore inherently prone to overfitting. The complexity of the weight count can be seen as a function of the number of channels, the spatial extent of the input and the number of layers of the network. Due to the use of convolutional layers the scaling of weight complexity is usually linear with regards to the resolution dimensions, but remains quadratic with respect to the number of channels. Active research in recent years in terms of using multigrid inspired ideas in deep neural networks have shown that on one hand a significant number of weights can be saved by appropriate weight sharing and on the other that a hierarchical structure in the channel dimension can improve the weight complexity to linear. In this work, we combine these multigrid ideas to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · 1x1 Convolution · Residual Connection · Bottleneck Residual Block · Convolution · Max Pooling · Kaiming Initialization · Residual Block · Average Pooling