Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis
Yedi Luo, Xiangyu Bai, Le Jiang, Aniket Gupta, Eric Mortin, Hanumant, Singh, Sarah Ostadabbas

TL;DR
This paper introduces TeFS, a novel method for generating high-quality synthetic stereo driving data from GTA V, enabling improved evaluation and development of vSLAM models for autonomous driving.
Contribution
The paper presents TeFS, a new technique for creating large-scale, high-fidelity stereo datasets from commercial game engines, filling a gap in available training data for vSLAM research.
Findings
GTAV-TeFS dataset contains over 88,000 stereo image pairs with rich metadata.
TeFS-generated data is validated against conventional dual-viewport data.
Benchmarking reveals strengths and limitations of current vSLAM models.
Abstract
This paper presents a novel approach, TeFS (Temporal-controlled Frame Swap), to generate synthetic stereo driving data for visual simultaneous localization and mapping (vSLAM) tasks. TeFS is designed to overcome the lack of native stereo vision support in commercial driving simulators, and we demonstrate its effectiveness using Grand Theft Auto V (GTA V), a high-budget open-world video game engine. We introduce GTAV-TeFS, the first large-scale GTA V stereo-driving dataset, containing over 88,000 high-resolution stereo RGB image pairs, along with temporal information, GPS coordinates, camera poses, and full-resolution dense depth maps. GTAV-TeFS offers several advantages over other synthetic stereo datasets and enables the evaluation and enhancement of state-of-the-art stereo vSLAM models under GTA V's environment. We validate the quality of the stereo data collected using TeFS by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Remote Sensing and LiDAR Applications
