Improving CNN-based Planar Object Detection with Geometric Prior   Knowledge

Jianxiong Cai; Jiawei Hou; Yiren Lu; Hongyu Chen; Laurent Kneip and; S\"oren Schwertfeger

arXiv:1909.10245·cs.RO·April 27, 2021

Improving CNN-based Planar Object Detection with Geometric Prior Knowledge

Jianxiong Cai, Jiawei Hou, Yiren Lu, Hongyu Chen, Laurent Kneip and, S\"oren Schwertfeger

PDF

TL;DR

This paper introduces a novel image rectification method using geometric prior knowledge to improve CNN-based planar object detection, reducing training data needs and enhancing accuracy for mobile robots with RGB-D sensors.

Contribution

It presents the first image rectification approach for CNN-based object detection and provides a new RGB-D hazmat sign dataset with source code.

Findings

01

Significant boost in detection performance after rectification.

02

Reduction in required training images for effective detection.

03

First public hazmat sign dataset with RGB-D sensors.

Abstract

In this paper, we focus on the question: how might mobile robots take advantage of affordable RGB-D sensors for object detection? Although current CNN-based object detectors have achieved impressive results, there are three main drawbacks for practical usage on mobile robots: 1) It is hard and time-consuming to collect and annotate large-scale training sets. 2) It usually needs a long training time. 3) CNN-based object detection shows significant weakness in predicting location. We propose an improved method for the detection of planar objects, which rectifies images with geometric information to compensate for the perspective distortion before feeding it to the CNN detector module, typically a CNN-based detector like YOLO or MASK RCNN. By dealing with the perspective distortion in advance, we eliminate the need for the CNN detector to learn that. Experiments show that this approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.