A Spectral Nonlocal Block for Neural Networks
Lei Zhu, Qi She, Lidan Zhang, Ping Guo

TL;DR
This paper introduces a spectral nonlocal block for neural networks that captures long-range dependencies more effectively, backed by a theoretical framework and extensive experiments across multiple vision tasks.
Contribution
It provides a unified graph filter interpretation of nonlocal blocks and proposes a new spectral nonlocal block that improves performance in various vision applications.
Findings
Improved accuracy on image classification benchmarks
Enhanced performance in action recognition tasks
Demonstrated robustness and efficiency of the spectral nonlocal block
Abstract
The nonlocal-based blocks are designed for capturing long-range spatial-temporal dependencies in computer vision tasks. Although having shown excellent performances, they lack the mechanism to encode the rich, structured information among elements in an image. In this paper, to theoretically analyze the property of these nonlocal-based blocks, we provide a unified approach to interpreting them, where we view them as a graph filter generated on a fully-connected graph. When the graph filter is approximated by Chebyshev polynomials, a generalized formulation can be derived for explaining the existing nonlocal-based blocks ( nonlocal block, nonlocal stage, double attention block). Furthermore, we propose an efficient and robust spectral nonlocal block, which can be flexibly inserted into deep neural networks to catch the long-range dependencies between spatial pixels or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Advanced Graph Neural Networks · Sparse and Compressive Sensing Techniques
