RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems
Jaro Meyer, Fr\'ed\'eric Giraud, Joschua W\"uthrich, Marc Pollefeys, Philipp F\"urnstahl, Lilian Calvet

TL;DR
RocSync introduces a low-cost, versatile method for millisecond-accurate synchronization of diverse camera systems using a custom LED clock, enhancing multi-view applications in uncontrolled environments.
Contribution
It presents a novel LED-based visual encoding technique for precise synchronization across heterogeneous cameras without specialized hardware.
Findings
Achieves 1.34 ms residual synchronization error.
Outperforms existing light-, audio-, and timecode-based methods.
Improves downstream tasks like pose estimation and 3D reconstruction.
Abstract
Accurate spatiotemporal alignment of multi-view video streams is essential for a wide range of dynamic-scene applications such as multi-view 3D reconstruction, pose estimation, and scene understanding. However, synchronizing multiple cameras remains a significant challenge, especially in heterogeneous setups combining professional and consumer-grade devices, visible and infrared sensors, or systems with and without audio, where common hardware synchronization capabilities are often unavailable. This limitation is particularly evident in real-world environments, where controlled capture conditions are not feasible. In this work, we present a low-cost, general-purpose synchronization method that achieves millisecond-level temporal alignment across diverse camera systems while supporting both visible (RGB) and infrared (IR) modalities. The proposed solution employs a custom-built…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Network Time Synchronization Technologies · Advanced Vision and Imaging
