Sixth-Sense: Self-Supervised Learning of Spatial Awareness of Humans from a Planar Lidar
Simone Arreghini, Nicholas Carlotti, Mirko Nava, Antonio Paolillo, Alessandro Giusti

TL;DR
This paper introduces a self-supervised method enabling robots with 1D LiDARs to detect and estimate human positions and orientations omnidirectionally, enhancing safety and interaction in shared spaces.
Contribution
It presents a novel self-supervised learning approach that uses RGB-D camera detections to train 1D LiDAR-based human detection and pose estimation models without manual labeling.
Findings
Achieved 71% precision and 80% recall in human detection
Estimated distance with 13cm mean absolute error
Estimated orientation with 44° mean absolute error
Abstract
Reliable localization of people is fundamental for service and social robots that must operate in close interaction with humans. State-of-the-art human detectors often rely on RGB-D cameras or costly 3D LiDARs. However, most commercial robots are equipped with cameras with a narrow field of view, leaving them unaware of users approaching from other directions, or inexpensive 1D LiDARs whose readings are hard to interpret. To address these limitations, we propose a self-supervised approach to detect humans and estimate their 2D pose from 1D LiDAR data, using detections from an RGB-D camera as supervision. Trained on 70 minutes of autonomously collected data, our model detects humans omnidirectionally in unseen environments with 71% precision, 80% recall, and mean absolute errors of 13cm in distance and 44{\deg} in orientation, measured against ground truth data. Beyond raw detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
