BoxNet: A Deep Learning Method for 2D Bounding Box Estimation from Bird's-Eye View Point Cloud
Ehsan Nezhadarya, Yang Liu, Bingbing Liu

TL;DR
BoxNet is a deep learning approach that efficiently estimates 2D bounding boxes from bird's-eye view LiDAR point clouds, handling unordered data and angle discontinuities to improve accuracy over traditional methods.
Contribution
It introduces a neural network that predicts box pose and size directly from unordered BEV points, including a novel angle estimation technique and relative center prediction.
Findings
Significant accuracy improvement over non-learning methods on KITTI dataset.
Effective handling of unordered point clouds and angle discontinuities.
Robust size and pose estimation from sparse LiDAR data.
Abstract
We present a learning-based method to estimate the object bounding box from its 2D bird's-eye view (BEV) LiDAR points. Our method, entitled BoxNet, exploits a simple deep neural network that can efficiently handle unordered points. The method takes as input the 2D coordinates of all the points and the output is a vector consisting of both the box pose (position and orientation in LiDAR coordinate system) and its size (width and length). In order to deal with the angle discontinuity problem, we propose to estimate the double-angle sinusoidal values rather than the angle itself. We also predict the center relative to the point cloud mean to boost the performance of estimating the location of the box. The proposed method does not rely on the ordering of points as in many existing approaches, and can accurately predict the actual size of the bounding box based on the prior information that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptical measurement and interference techniques · 3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage
