Attentional Feature Fusion

Yimian Dai; Fabian Gieseke; Stefan Oehmcke; Yiquan Wu and; Kobus Barnard

arXiv:2009.14082·cs.CV·November 10, 2020·6 cites

Attentional Feature Fusion

Yimian Dai, Fabian Gieseke, Stefan Oehmcke, Yiquan Wu and, Kobus Barnard

PDF

Open Access 2 Repos

TL;DR

This paper introduces a unified attentional feature fusion scheme with multi-scale and iterative attention modules, improving feature integration in neural networks and outperforming state-of-the-art models on CIFAR-100 and ImageNet.

Contribution

It proposes a general attentional feature fusion framework with multi-scale and iterative attention modules, enhancing feature integration across scales and semantics.

Findings

01

Outperforms state-of-the-art on CIFAR-100 and ImageNet

02

Fewer layers or parameters achieve better results

03

Attention mechanisms improve feature fusion effectiveness

Abstract

Feature fusion, the combination of features from different layers or branches, is an omnipresent part of modern network architectures. It is often implemented via simple operations, such as summation or concatenation, but this might not be the best choice. In this work, we propose a uniform and general scheme, namely attentional feature fusion, which is applicable for most common scenarios, including feature fusion induced by short and long skip connections as well as within Inception layers. To better fuse features of inconsistent semantics and scales, we propose a multi-scale channel attention module, which addresses issues that arise when fusing features given at different scales. We also demonstrate that the initial integration of feature maps can become a bottleneck and that this issue can be alleviated by adding another level of attention, which we refer to as iterative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning