Addressing Ambiguity in Imitation Learning through Product of Experts based Negative Feedback

John Bateman; Andy M. Tyrrell; Jihong Zhu

arXiv:2603.26467·cs.RO·March 30, 2026

Addressing Ambiguity in Imitation Learning through Product of Experts based Negative Feedback

John Bateman, Andy M. Tyrrell, Jihong Zhu

PDF

TL;DR

This paper introduces a negative-feedback imitation learning system that effectively leverages suboptimal demonstrations to improve robot task performance, especially in ambiguous scenarios, validated through simulations and real robots.

Contribution

It proposes a novel negative-feedback approach based on Product of Experts to address ambiguity and learn from failures in imitation learning, outperforming existing methods.

Findings

01

Achieved 90% success rate improvement over non-negative feedback systems.

02

Demonstrated higher efficacy, memory, and time efficiency than comparable schemes.

03

Validated results through both simulated and real-robot experiments.

Abstract

Programming robots to perform complex tasks is often difficult and time consuming, requiring expert knowledge and skills in robot software and sometimes hardware. Imitation learning is a method for training robots to perform tasks by leveraging human expertise through demonstrations. Typically, the assumption is that those demonstrations are performed by a single, highly competent expert. However, in many real-world applications that use user demonstrations for tasks or incorporate both user data and pretrained data, such as home robotics including assistive robots, this is unlikely to be the case. This paper presents research towards a system which can leverage suboptimal demonstrations to solve ambiguous tasks; and particularly learn from its own failures. This is a negative-feedback system which achieves significant improvement over purely positive imitation learning for ambiguous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.