# Butterfly Transform: An Efficient FFT Based Neural Architecture Design

**Authors:** Keivan Alizadeh Vahid, Anish Prabhu, Ali Farhadi, Mohammad Rastegari

arXiv: 1906.02256 · 2020-04-20

## TL;DR

This paper introduces the Butterfly Transform (BFT), a neural architecture component inspired by FFT, which replaces costly pointwise convolutions in CNNs, leading to improved accuracy and reduced computational complexity, especially in low FLOP regimes.

## Contribution

The paper proposes BFT, an FFT-inspired transform that optimizes channel fusion in CNNs, reducing FLOP count from O(n^2) to O(n log n) and enhancing performance.

## Key findings

- BFT reduces pointwise convolution FLOPs from O(n^2) to O(n log n).
- BFT improves accuracy by up to 6.75% on ImageNet for MobileNetV1.
- ShuffleNet-V2+BFT outperforms state-of-the-art methods in low FLOP regimes.

## Abstract

In this paper, we show that extending the butterfly operations from the FFT algorithm to a general Butterfly Transform (BFT) can be beneficial in building an efficient block structure for CNN designs. Pointwise convolutions, which we refer to as channel fusions, are the main computational bottleneck in the state-of-the-art efficient CNNs (e.g. MobileNets ). We introduce a set of criteria for channel fusion and prove that BFT yields an asymptotically optimal FLOP count with respect to these criteria. By replacing pointwise convolutions with BFT, we reduce the computational complexity of these layers from O(n^2) to O(n\log n) with respect to the number of channels. Our experimental evaluations show that our method results in significant accuracy gains across a wide range of network architectures, especially at low FLOP ranges. For example, BFT results in up to a 6.75% absolute Top-1 improvement for MobileNetV1, 4.4 \% for ShuffleNet V2 and 5.4% for MobileNetV3 on ImageNet under a similar number of FLOPS. Notably, ShuffleNet-V2+BFT outperforms state-of-the-art architecture search methods MNasNet, FBNet and MobilenetV3 in the low FLOP regime.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.02256/full.md

## Figures

19 figures with captions in the complete paper: https://tomesphere.com/paper/1906.02256/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/1906.02256/full.md

---
Source: https://tomesphere.com/paper/1906.02256