DeconDTN-Toolkit: A Library for Evaluation and Enhancement of Robustness to Provenance Shift

Yongsen Tan; Zhecheng Sheng; Xiruo Ding; Serguei V. S. Pakhomov; Trevor Cohen

arXiv:2605.11237·cs.LG·May 13, 2026

DeconDTN-Toolkit: A Library for Evaluation and Enhancement of Robustness to Provenance Shift

Yongsen Tan, Zhecheng Sheng, Xiruo Ding, Serguei V. S. Pakhomov, Trevor Cohen

PDF

TL;DR

This paper introduces DeconDTN-Toolkit, a comprehensive library for evaluating and improving model robustness against provenance shifts, which are changes in data source relationships affecting deployment performance.

Contribution

It establishes a formal connection between provenance shift and invariant learning, and provides a toolkit for simulation, evaluation, and mitigation of provenance-related distribution shifts.

Findings

01

Empirical Risk Minimization is vulnerable under provenance shift.

02

The toolkit enables simulation of provenance shifts of varying degrees.

03

A new robustness indicator improves out-of-distribution performance assessment.

Abstract

Despite the burgeoning body of work on distribution shifts, provenance shift-where the relationship between data source and label changes at deployment-remains poorly understood and under-addressed. In this paper, we establish a formal connection between provenance shift, counterfactual invariance, and invariant learning to derive a learning objective for robustness. We then introduce \textsc{DeconDTN-Toolkit}, a specialized evaluation and remediation suite designed to simulate provenance shifts of varying degrees while maintaining the training protocol and the infrastructure of existing benchmarks. We reveal the vulnerability of Empirical Risk Minimization under provenance shift, introduce a robust out-of-distribution performance indicator, and conduct a comprehensive evaluation on existing algorithms. Our work provides both the theoretical grounding and the practical tools necessary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.