InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation
Marziyeh Bamdad, Hans-Peter Hutter, Alireza Darvishy

TL;DR
InCrowd-VI is a comprehensive visual-inertial dataset capturing realistic indoor pedestrian scenarios, revealing current SLAM limitations and aiding development for visually impaired navigation.
Contribution
The paper introduces InCrowd-VI, a new dataset with challenging indoor pedestrian scenarios for evaluating and improving SLAM algorithms.
Findings
State-of-the-art SLAM algorithms struggle with accuracy in crowded environments.
Deep learning methods have high coverage but lack real-time performance.
Classical methods show significant drift under challenging conditions.
Abstract
Simultaneous localization and mapping (SLAM) techniques can be used to navigate the visually impaired, but the development of robust SLAM solutions for crowded spaces is limited by the lack of realistic datasets. To address this, we introduce InCrowd-VI, a novel visual-inertial dataset specifically designed for human navigation in indoor pedestrian-rich environments. Recorded using Meta Aria Project glasses, it captures realistic scenarios without environmental control. InCrowd-VI features 58 sequences totaling a 5 km trajectory length and 1.5 hours of recording time, including RGB, stereo images, and IMU measurements. The dataset captures important challenges such as pedestrian occlusions, varying crowd densities, complex layouts, and lighting changes. Ground-truth trajectories, accurate to approximately 2 cm, are provided in the dataset, originating from the Meta Aria project machine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage · Automated Road and Building Extraction
Methodstravel james · Adaptive Richard's Curve Weighted Activation
