Featurized Query R-CNN
Wenqiang Zhang, Tianheng Cheng, Xinggang Wang, Shaoyu Chen, and Qian Zhang, Wenyu Liu

TL;DR
Featurized Query R-CNN introduces a query generation network to produce object queries within the Faster R-CNN framework, reducing computation and improving generalization, achieving superior speed-accuracy trade-offs on COCO.
Contribution
The paper proposes a novel featurized query generation method integrated into Faster R-CNN, addressing efficiency and generalization issues in query-based object detection.
Findings
Outperforms state-of-the-art R-CNN detectors on COCO
Reduces computational burden compared to multi-stage decoders
Enhances generalization of object queries
Abstract
The query mechanism introduced in the DETR method is changing the paradigm of object detection and recently there are many query-based methods have obtained strong object detection performance. However, the current query-based detection pipelines suffer from the following two issues. Firstly, multi-stage decoders are required to optimize the randomly initialized object queries, incurring a large computation burden. Secondly, the queries are fixed after training, leading to unsatisfying generalization capability. To remedy the above issues, we present featurized object queries predicted by a query generation network in the well-established Faster R-CNN framework and develop a Featurized Query R-CNN. Extensive experiments on the COCO dataset show that our Featurized Query R-CNN obtains the best speed-accuracy trade-off among all R-CNN detectors, including the recent state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Big Data and Digital Economy
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Label Smoothing · Absolute Position Encodings · Byte Pair Encoding · Dropout · Adam · Residual Connection · Position-Wise Feed-Forward Layer
