Distributionally-Constrained Adversaries in Online Learning
Mo\"ise Blanchard, Samory Kpotufe

TL;DR
This paper explores a flexible framework for online learning where adversaries choose data distributions within certain constraints, providing new insights into when and how learning is possible across various distributional settings.
Contribution
It characterizes which distribution classes are learnable against different adversaries and shows that learners can adapt to multiple distribution constraints without prior knowledge.
Findings
Characterizes learnability for various distribution classes.
Generalizes known results from smoothed analysis.
Learners can adapt to multiple distribution constraints simultaneously.
Abstract
There has been much recent interest in understanding the continuum from adversarial to stochastic settings in online learning, with various frameworks including smoothed settings proposed to bridge this gap. We consider the more general and flexible framework of distributionally constrained adversaries in which instances are drawn from distributions chosen by an adversary within some constrained distribution class [RST11]. Compared to smoothed analysis, we consider general distributional classes which allows for a fine-grained understanding of learning settings between fully stochastic and fully adversarial for which a learner can achieve non-trivial regret. We give a characterization for which distribution classes are learnable in this context against both oblivious and adaptive adversaries, providing insights into the types of interplay between the function class and distributional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Bandit Algorithms Research · Machine Learning and Algorithms
