Poly-NL: Linear Complexity Non-local Layers with Polynomials
Francesca Babiloni, Ioannis Marras, Filippos Kokkinos, Jiankang Deng,, Grigorios Chrysos, Stefanos Zafeiriou

TL;DR
Poly-NL introduces a novel linear complexity non-local layer for neural networks by framing them as polynomial functions, enabling efficient long-range dependency modeling without performance loss.
Contribution
The paper proposes Poly-NL, a new approach that reformulates non-local layers as polynomial functions, achieving linear complexity while maintaining full expressiveness.
Findings
Poly-NL reduces complexity from quadratic to linear.
Poly-NL matches state-of-the-art performance in various tasks.
Poly-NL has significantly less computational overhead.
Abstract
Spatial self-attention layers, in the form of Non-Local blocks, introduce long-range dependencies in Convolutional Neural Networks by computing pairwise similarities among all possible positions. Such pairwise functions underpin the effectiveness of non-local layers, but also determine a complexity that scales quadratically with respect to the input size both in space and time. This is a severely limiting factor that practically hinders the applicability of non-local blocks to even moderately sized inputs. Previous works focused on reducing the complexity by modifying the underlying matrix operations, however in this work we aim to retain full expressiveness of non-local layers while keeping complexity linear. We overcome the efficiency limitation of non-local blocks by framing them as special cases of 3rd order polynomial functions. This fact enables us to formulate novel fast…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
