Tether: Autonomous Functional Play with Correspondence-Driven Trajectory Warping
William Liang, Sam Wang, Hung-Ju Wang, Osbert Bastani, Yecheng Jason Ma, Dinesh Jayaraman

TL;DR
Tether enables autonomous, multi-task robotic play by warping demonstration actions through semantic keypoints, allowing continuous learning and data collection in real-world household environments with minimal human input.
Contribution
Introduces a novel semantic keypoint-based action warping policy and an autonomous play cycle that improves robot learning from limited demonstrations.
Findings
Robust policy warping from few demonstrations
Autonomous multi-task play over hours in real-world settings
Generated over 1000 expert-level trajectories
Abstract
The ability to conduct and learn from interaction and experience is a central challenge in robotics, offering a scalable alternative to labor-intensive human demonstrations. However, realizing such "play" requires (1) a policy robust to diverse, potentially out-of-distribution environment states, and (2) a procedure that continuously produces useful robot experience. To address these challenges, we introduce Tether, a method for autonomous functional play involving structured, task-directed interactions. First, we design a novel open-loop policy that warps actions from a small set of source demonstrations (<=10) by anchoring them to semantic keypoint correspondences in the target scene. We show that this design is extremely data-efficient and robust even under significant spatial and semantic variations. Second, we deploy this policy for autonomous functional play in the real world via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Social Robot Interaction and HRI
