FCOS: Fully Convolutional One-Stage Object Detection
Zhi Tian, Chunhua Shen, Hao Chen, Tong He

TL;DR
FCOS introduces an anchor-free, proposal-free, fully convolutional one-stage object detector that simplifies the detection process while achieving state-of-the-art accuracy, surpassing previous methods.
Contribution
The paper presents FCOS, a novel anchor-free and proposal-free object detection framework that simplifies training and hyper-parameter tuning while improving detection accuracy.
Findings
Achieves 44.7% AP with single-model, single-scale testing.
Outperforms previous one-stage detectors in accuracy.
Simplifies the detection pipeline by removing anchor boxes.
Abstract
We propose a fully convolutional one-stage object detector (FCOS) to solve object detection in a per-pixel prediction fashion, analogue to semantic segmentation. Almost all state-of-the-art object detectors such as RetinaNet, SSD, YOLOv3, and Faster R-CNN rely on pre-defined anchor boxes. In contrast, our proposed detector FCOS is anchor box free, as well as proposal free. By eliminating the predefined set of anchor boxes, FCOS completely avoids the complicated computation related to anchor boxes such as calculating overlapping during training. More importantly, we also avoid all hyper-parameters related to anchor boxes, which are often very sensitive to the final detection performance. With the only post-processing non-maximum suppression (NMS), FCOS with ResNeXt-64x4d-101 achieves 44.7% in AP with single-model and single-scale testing, surpassing previous one-stage detectors with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
MethodsLogistic Regression · Focal Loss · k-Means Clustering · Average Pooling · RetinaNet · ResNeXt Block · BNB Customer Service Number +1-833-534-1729 · YOLOv3 · SSD · Weight Decay
