Geometric Neural Phrase Pooling: Modeling the Spatial Co-occurrence of Neurons
Lingxi Xie, Qi Tian, John Flynn, Jingdong Wang, Alan Yuille

TL;DR
This paper introduces Geometric Neural Phrase Pooling (GNPP), a novel method for modeling spatial co-occurrence of neuron responses in CNNs, leading to improved image classification accuracy.
Contribution
It proposes GNPP, a new pooling layer that encodes neural phrases based on spatial relationships, enhancing CNN performance with minimal computational cost.
Findings
GNPP improves classification accuracy across multiple datasets.
GNPP effectively models spatial co-occurrence of neuron responses.
The method introduces minimal additional computational overhead.
Abstract
Deep Convolutional Neural Networks (CNNs) are playing important roles in state-of-the-art visual recognition. This paper focuses on modeling the spatial co-occurrence of neuron responses, which is less studied in the previous work. For this, we consider the neurons in the hidden layer as neural words, and construct a set of geometric neural phrases on top of them. The idea that grouping neural words into neural phrases is borrowed from the Bag-of-Visual-Words (BoVW) model. Next, the Geometric Neural Phrase Pooling (GNPP) algorithm is proposed to efficiently encode these neural phrases. GNPP acts as a new type of hidden layer, which punishes the isolated neuron responses after convolution, and can be inserted into a CNN model with little extra computational overhead. Experimental results show that GNPP produces significant and consistent accuracy gain in image classification.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Advanced Neural Network Applications · Image Processing Techniques and Applications
