TL;DR
COBALT is a scalable, cloud-based teleoperation platform enabling large-scale robot learning through crowdsourced demonstrations via smartphones and other common devices, supporting high concurrency and data quality.
Contribution
The paper introduces COBALT, a novel teleoperation system that democratizes robot learning at scale, supporting multiple users and devices with efficient infrastructure and data filtering.
Findings
Supports 8 concurrent users per GPU with 20 Hz control at sub-100 ms latency.
Demonstrates stable operation with 256 simulated clients across 8 GPUs.
Crowdsourced 7500+ demonstrations over five days, enabling high-quality dataset collection.
Abstract
The scarcity of large-scale, high-quality demonstration data remains a bottleneck in scaling imitation learning for robotic manipulation. We present COBALT, a teleoperation platform designed to democratize robot learning at scale both in simulation and in the real world. By leveraging vectorized environments, our scalable, load-balanced infrastructure supports concurrent teleoperation by multiple users on a single GPU, yielding a significant reduction in teleoperation cost. Operators can connect from nearly anywhere on Earth using commonly available devices, including single or dual smartphones, VR headsets, 3D mice, and keyboards. An inmemory data cache and efficient video streaming keep control and rendering synchronous, sustaining dozens of concurrent users at 20 Hz with sub-100 ms end-to-end latency for up to 8 concurrent users per GPU. We also demonstrate stable operation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
