A Spectral Nonlocal Block for Neural Networks

Lei Zhu; Qi She; Lidan Zhang; Ping Guo

arXiv:1911.01059·cs.CV·February 11, 2020·1 cites

A Spectral Nonlocal Block for Neural Networks

Lei Zhu, Qi She, Lidan Zhang, Ping Guo

PDF

Open Access

TL;DR

This paper introduces a spectral nonlocal block for neural networks that captures long-range dependencies more effectively, backed by a theoretical framework and extensive experiments across multiple vision tasks.

Contribution

It provides a unified graph filter interpretation of nonlocal blocks and proposes a new spectral nonlocal block that improves performance in various vision applications.

Findings

01

Improved accuracy on image classification benchmarks

02

Enhanced performance in action recognition tasks

03

Demonstrated robustness and efficiency of the spectral nonlocal block

Abstract

The nonlocal-based blocks are designed for capturing long-range spatial-temporal dependencies in computer vision tasks. Although having shown excellent performances, they lack the mechanism to encode the rich, structured information among elements in an image. In this paper, to theoretically analyze the property of these nonlocal-based blocks, we provide a unified approach to interpreting them, where we view them as a graph filter generated on a fully-connected graph. When the graph filter is approximated by Chebyshev polynomials, a generalized formulation can be derived for explaining the existing nonlocal-based blocks ( $e.g.,$ nonlocal block, nonlocal stage, double attention block). Furthermore, we propose an efficient and robust spectral nonlocal block, which can be flexibly inserted into deep neural networks to catch the long-range dependencies between spatial pixels or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Advanced Graph Neural Networks · Sparse and Compressive Sensing Techniques