ATLAS: An Annotation Tool for Long-horizon Robotic Action Segmentation
Sergej Stanovcic, Daniel Sliwowski, Dongheui Lee

TL;DR
ATLAS is a versatile annotation tool designed for long-horizon robotic demonstrations, supporting multi-modal data visualization and efficient boundary annotation to improve action segmentation accuracy.
Contribution
It introduces a modular, multi-modal annotation platform that supports various dataset formats and enhances annotation efficiency and precision for robotic tasks.
Findings
Reduced annotation time by at least 6% compared to ELAN.
Improved temporal alignment with expert annotations by over 2.8%.
Decreased boundary error fivefold compared to vision-only tools.
Abstract
Annotating long-horizon robotic demonstrations with precise temporal action boundaries is crucial for training and evaluating action segmentation and manipulation policy learning methods. Existing annotation tools, however, are often limited: they are designed primarily for vision-only data, do not natively support synchronized visualization of robot-specific time-series signals (e.g., gripper state or force/torque), or require substantial effort to adapt to different dataset formats. In this paper, we introduce ATLAS, an annotation tool tailored for long-horizon robotic action segmentation. ATLAS provides time-synchronized visualization of multi-modal robotic data, including multi-view video and proprioceptive signals, and supports annotation of action boundaries, action labels, and task outcomes. The tool natively handles widely used robotics dataset formats such as ROS bags and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
