TL;DR
This paper introduces a lightweight, real-time structural detection framework using Bird's-Eye-View images from 3D LiDAR data, suitable for resource-constrained robots, with a focus on efficiency and robustness.
Contribution
The work presents a novel BEV-based perception pipeline that balances robustness and computational efficiency for real-time structural detection on low-power robotic platforms.
Findings
YOLO-OBB achieves real-time detection at 10 Hz on low-power hardware.
Classical methods are fast but sensitive to noise; RANSAC is robust but not real-time.
The proposed pipeline effectively filters cluttered observations without GPU acceleration.
Abstract
Efficient structural perception is essential for mapping and autonomous navigation on resource-constrained robots. Existing 3D methods are computationally prohibitive, while traditional 2D geometric approaches lack robustness. This paper presents a lightweight, real-time framework that projects 3D LiDAR data into 2D Bird's-Eye-View (BEV) images to enable efficient detection of structural elements relevant to mapping and navigation. Within this representation, we systematically evaluate several feature extraction strategies, including classical geometric techniques (Hough Transform, RANSAC, and LSD) and a deep learning detector based on YOLO-OBB. The resulting detections are integrated through a spatiotemporal fusion module that improves stability and robustness across consecutive frames. Experiments conducted on a standard mobile robotic platform highlight clear performance trade-offs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
