Dataset Poisoning Attacks on Behavioral Cloning Policies

Akansha Kalra; Soumil Datta; Ethan Gilmore; Duc La; Guanhong Tao; Daniel S. Brown

arXiv:2511.20992·cs.LG·November 27, 2025

Dataset Poisoning Attacks on Behavioral Cloning Policies

Akansha Kalra, Soumil Datta, Ethan Gilmore, Duc La, Guanhong Tao, Daniel S. Brown

PDF

Open Access

TL;DR

This paper investigates the vulnerability of Behavior Cloning policies to clean-label backdoor attacks, revealing that even minimally poisoned datasets can produce deceptively high performance while being highly susceptible to trigger-based manipulations during deployment.

Contribution

It introduces the first analysis of backdoor attacks on BC policies, including a novel entropy-based trigger method and evaluates how attack effectiveness scales with poisoning parameters.

Findings

01

BC policies are highly vulnerable to backdoor triggers during deployment.

02

Minimal poisoning can cause significant performance degradation with trigger attacks.

03

Deceptively high baseline performance masks underlying vulnerability.

Abstract

Behavior Cloning (BC) is a popular framework for training sequential decision policies from expert demonstrations via supervised learning. As these policies are increasingly being deployed in the real world, their robustness and potential vulnerabilities are an important concern. In this work, we perform the first analysis of the efficacy of clean-label backdoor attacks on BC policies. Our backdoor attacks poison a dataset of demonstrations by injecting a visual trigger to create a spurious correlation that can be exploited at test time. We evaluate how policy vulnerability scales with the fraction of poisoned data, the strength of the trigger, and the trigger type. We also introduce a novel entropy-based test-time trigger attack that substantially degrades policy performance by identifying critical states where test-time triggering of the backdoor is expected to be most effective at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Reinforcement Learning in Robotics