PARTNR: Pick and place Ambiguity Resolving by Trustworthy iNteractive leaRning
Jelle Luijkx, Zlatan Ajanovic, Laura Ferranti, Jens Kober

TL;DR
The paper introduces PARTNR, an algorithm that detects ambiguities in robot policies using multimodal analysis and adaptive thresholds, reducing demonstrations and improving trustworthiness in pick-and-place tasks.
Contribution
PARTNR extends existing methods by incorporating ambiguity detection and adaptive learning to enhance policy robustness and reduce demonstration needs.
Findings
PARTNR effectively detects ambiguities in pick-and-place tasks.
The adaptive threshold reduces the number of demonstrations needed.
System increases trustworthiness by autonomously handling ambiguities.
Abstract
Several recent works show impressive results in mapping language-based human commands and image scene observations to direct robot executable policies (e.g., pick and place poses). However, these approaches do not consider the uncertainty of the trained policy and simply always execute actions suggested by the current policy as the most probable ones. This makes them vulnerable to domain shift and inefficient in the number of required demonstrations. We extend previous works and present the PARTNR algorithm that can detect ambiguities in the trained policy by analyzing multiple modalities in the pick and place poses using topological analysis. PARTNR employs an adaptive, sensitivity-based, gating function that decides if additional user demonstrations are required. User demonstrations are aggregated to the dataset and used for subsequent training. In this way, the policy can adapt…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Robot Manipulation and Learning
