Understanding the Limits of Unsupervised Domain Adaptation via Data Poisoning
Akshay Mehra, Bhavya Kailkhura, Pin-Yu Chen, Jihun Hamm

TL;DR
This paper establishes theoretical limits of unsupervised domain adaptation (UDA), introduces data poisoning attacks to exploit these limits, and demonstrates significant accuracy drops in UDA methods under attack, highlighting their robustness issues.
Contribution
The paper provides a lower bound on target error in UDA, proposes data poisoning attacks to challenge UDA methods, and empirically shows their vulnerability to such attacks.
Findings
Lower bound on target domain error established
Poisoning attacks can reduce UDA accuracy to near zero
UDA methods are vulnerable to data poisoning in benchmark datasets
Abstract
Unsupervised domain adaptation (UDA) enables cross-domain learning without target domain labels by transferring knowledge from a labeled source domain whose distribution differs from that of the target. However, UDA is not always successful and several accounts of `negative transfer' have been reported in the literature. In this work, we prove a simple lower bound on the target domain error that complements the existing upper bound. Our bound shows the insufficiency of minimizing source domain error and marginal distribution mismatch for a guaranteed reduction in the target domain error, due to the possible increase of induced labeling function mismatch. This insufficiency is further illustrated through simple distributions for which the same UDA approach succeeds, fails, and may succeed or fail with an equal chance. Motivated from this, we propose novel data poisoning attacks to fool…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
