Real-Time Operator Takeover for Visuomotor Diffusion Policy Training

Marco Moletta; Michael C. Welle; Nils Ingelhag; Jesper Munkeby; Danica Kragic

arXiv:2502.02308·cs.RO·April 1, 2026

Real-Time Operator Takeover for Visuomotor Diffusion Policy Training

Marco Moletta, Michael C. Welle, Nils Ingelhag, Jesper Munkeby, Danica Kragic

PDF

1 Repo

TL;DR

This paper introduces a real-time operator takeover framework for visuomotor diffusion policies, allowing seamless control intervention to improve policy robustness and performance across diverse object manipulation tasks.

Contribution

The authors propose a novel real-time takeover paradigm that enhances visuomotor policy training with targeted demonstrations and automatic out-of-distribution state detection.

Findings

01

Targeted takeover demonstrations significantly improve policy performance.

02

The Mahalanobis distance effectively identifies undesirable states during execution.

03

The framework is validated on tasks involving rigid, deformable, and granular objects.

Abstract

We present a Real-Time Operator Takeover (RTOT) paradigm that enables operators to seamlessly take control of a live visuomotor diffusion policy, guiding the system back to desirable states or providing targeted corrective demonstrations. Within this framework, the operator can intervene to correct the robot's motion, after which control is smoothly returned to the policy until further intervention is needed. We evaluate the takeover framework on three tasks spanning rigid, deformable, and granular objects, and show that incorporating targeted takeover demonstrations significantly improves policy performance compared with training on an equivalent number of initial demonstrations alone. Additionally, we provide an in-depth analysis of the Mahalanobis distance as a signal for automatically identifying undesirable or out-of-distribution states during execution. Supporting materials,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://operator-takeover.github.io
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.