DC4L: Distribution Shift Recovery via Data-Driven Control for Deep Learning Models
Vivian Lin, Kuk Jin Jang, Souradeep Dutta, Michele Caprio, Oleg, Sokolsky, Insup Lee

TL;DR
This paper introduces a control-based method using reinforcement learning to adaptively sanitize images and recover from distribution shifts in deep learning models, significantly improving robustness on benchmarks like ImageNet-C and CIFAR-100-C.
Contribution
It formulates distribution shift recovery as a Markov decision process and employs online data sanitization with theoretical guarantees, advancing robustness techniques.
Findings
Up to 14.21% accuracy improvement on ImageNet-C
Effective generalization to composite shifts
Robustness gains on CIFAR-100-C
Abstract
Deep neural networks have repeatedly been shown to be non-robust to the uncertainties of the real world, even to naturally occurring ones. A vast majority of current approaches have focused on data-augmentation methods to expand the range of perturbations that the classifier is exposed to while training. A relatively unexplored avenue that is equally promising involves sanitizing an image as a preprocessing step, depending on the nature of perturbation. In this paper, we propose to use control for learned models to recover from distribution shifts online. Specifically, our method applies a sequence of semantic-preserving transformations to bring the shifted data closer in distribution to the training set, as measured by the Wasserstein distance. Our approach is to 1) formulate the problem of distribution shift recovery as a Markov decision process, which we solve using reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
