NL-FCOS: Improving FCOS through Non-Local Modules for Object Detection
Lukas Pavez, Jose M. Saavedra Rondo

TL;DR
This paper introduces NL-FCOS, an anchor-free object detection model enhanced with non-local modules to mimic perceptual grouping, achieving state-of-the-art results in clothing detection and handwritten amount recognition.
Contribution
It proposes integrating non-local modules into FCOS to improve feature representation by modeling perceptual grouping without increasing inference time.
Findings
State-of-the-art performance in clothing detection.
Effective enhancement for handwritten amount recognition.
Non-local modules boost feature map cooperation.
Abstract
During the last years, we have seen significant advances in the object detection task, mainly due to the outperforming results of convolutional neural networks. In this vein, anchor-based models have achieved the best results. However, these models require prior information about the aspect and scales of target objects, needing more hyperparameters to fit. In addition, using anchors to fit bounding boxes seems far from how our visual system does the same visual task. Instead, our visual system uses the interactions of different scene parts to semantically identify objects, called perceptual grouping. An object detection methodology closer to the natural model is anchor-free detection, where models like FCOS or Centernet have shown competitive results, but these have not yet exploited the concept of perceptual grouping. Therefore, to increase the effectiveness of anchor-free models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Visual Attention and Saliency Detection
MethodsFeature Pyramid Network · 1x1 Convolution · Convolution · Batch Normalization · Non Maximum Suppression · Center Pooling · Deep Layer Aggregation · Cascade Corner Pooling · FCOS · CenterNet
