Studying inductive biases in image classification task
Nana Arizumi

TL;DR
This paper investigates the role of inductive biases in image classification, comparing CNNs and self-attention structures, and introduces CADA to analyze how local and spatial invariance biases affect performance.
Contribution
The study introduces context-aware decomposed attention (CADA) to dissect inductive biases in local self-attention networks and compares their effectiveness to CNNs on ImageNet.
Findings
Context awareness is crucial for local self-attention performance.
Large local information is not necessary for effective CA parameters.
Relaxed spatial invariance improves accuracy over strict invariance.
Abstract
Recently, self-attention (SA) structures became popular in computer vision fields. They have locally independent filters and can use large kernels, which contradicts the previously popular convolutional neural networks (CNNs). CNNs success was attributed to the hard-coded inductive biases of locality and spatial invariance. However, recent studies have shown that inductive biases in CNNs are too restrictive. On the other hand, the relative position encodings, similar to depthwise (DW) convolution, are necessary for the local SA networks, which indicates that the SA structures are not entirely spatially variant. Hence, we would like to determine which part of inductive biases contributes to the success of the local SA structures. To do so, we introduced context-aware decomposed attention (CADA), which decomposes attention maps into multiple trainable base kernels and accumulates them…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Remote-Sensing Image Classification
MethodsConvolution · Balanced Selection
