IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird's-Eye View Perception
Haichuan Li, Changda Tian, Panos Trahanias, Tomi Westerlund

TL;DR
IndoorBEV introduces a mask-based Bird's-Eye View approach for indoor perception, enabling joint detection and footprint completion of static and dynamic objects, improving robustness in complex indoor scenes.
Contribution
The paper presents a novel mask-based BEV method with a specialized architecture for joint object detection and footprint prediction in indoor environments.
Findings
Effective handling of occlusions and varied object shapes.
Robust detection of static and dynamic objects.
Improved indoor scene understanding demonstrated.
Abstract
Detecting diverse objects within complex indoor 3D point clouds presents significant challenges for robotic perception, particularly with varied object shapes, clutter, and the co-existence of static and dynamic elements where traditional bounding box methods falter. To address these limitations, we propose IndoorBEV, a novel mask-based Bird's-Eye View (BEV) method for indoor mobile robots. In a BEV method, a 3D scene is projected into a 2D BEV grid which handles naturally occlusions and provides a consistent top-down view aiding to distinguish static obstacles from dynamic agents. The obtained 2D BEV results is directly usable to downstream robotic tasks like navigation, motion prediction, and planning. Our architecture utilizes an axis compact encoder and a window-based backbone to extract rich spatial features from this BEV map. A query-based decoder head then employs learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Automated Road and Building Extraction · Remote Sensing and LiDAR Applications
