Demonstration Sidetracks: Categorizing Systematic Non-Optimality in Human Demonstrations
Shijie Fang, Hang Yu, Qidi Fang, Reuben M. Aronson, Elaine S. Short

TL;DR
This paper identifies and categorizes systematic non-optimal behaviors, called demonstration sidetracks, in human demonstrations for robot learning, highlighting their patterns and implications for improving Learning from Demonstration methods.
Contribution
It introduces a novel taxonomy of demonstration sidetracks, providing detailed analysis of their types, distribution, and dependence on control interfaces in human demonstrations.
Findings
Sidetracks are frequent and context-dependent.
Four types of sidetracks identified: Exploration, Mistake, Alignment, Pause.
Control patterns vary with interface and influence demonstration quality.
Abstract
Learning from Demonstration (LfD) is a popular approach for robots to acquire new skills, but most LfD methods suffer from imperfections in human demonstrations. Prior work typically treats these suboptimalities as random noise. In this paper we study non-optimal behaviors in non-expert demonstrations and show that they are systematic, forming what we call demonstration sidetracks. Using a public space study with 40 participants performing a long-horizon robot task, we recreated the setup in simulation and annotated all demonstrations. We identify four types of sidetracks (Exploration, Mistake, Alignment, Pause) and one control pattern (one-dimension control). Sidetracks appear frequently across participants, and their temporal and spatial distribution is tied to task context. We also find that users' control patterns depend on the control interface. These insights point to the need for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Decision Making
