See and Switch: Vision-Based Branching for Interactive Robot-Skill Programming
Petr Vanc, Jan Kristof Behrens, V\'aclav Hlav\'a\v{c}, Karla Stepanova

TL;DR
See & Switch introduces a vision-based framework for interactive robot skill programming that enables reliable, high-dimensional perception-grounded branching during task execution, improving scalability and robustness in real-world scenarios.
Contribution
The paper presents a novel vision-based branch selection method using eye-in-hand images, enabling interactive, modality-independent robot skill programming with anomaly detection.
Findings
Achieved 90.7% accuracy in branch selection
Achieved 87.9% accuracy in anomaly detection
Validated on three dexterous manipulation tasks
Abstract
Programming robots by demonstration (PbD) is an intuitive concept, but scaling it to real-world variability remains a challenge for most current teaching frameworks. Conditional task graphs are very expressive and can be defined incrementally, which fits very well with the PbD idea. However, acting using conditional task graphs requires reliable perception-grounded online branch selection. In this paper, we present See & Switch, an interactive teaching-and-execution framework that represents tasks as user-extendable graphs of skill parts connected via decision states (DS), enabling conditional branching during replay. Unlike prior approaches that rely on manual branching or low-dimensional signals (e.g., proprioception), our vision-based Switcher uses eye-in-hand images (high-dimensional) to select among competing successor skill parts and to detect out-of-distribution contexts that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Teaching and Learning Programming
