Online Importance Weight Aware Updates
Nikos Karampatziakis, John Langford

TL;DR
This paper introduces a novel importance weight-aware update method for online learning that improves robustness and efficiency, especially in active learning scenarios with large importance weights and adversarial noise.
Contribution
It develops an invariance-preserving update approach for importance weights, providing theoretical guarantees and practical improvements over standard gradient multiplication methods.
Findings
Superior prediction accuracy with similar computational cost
Reduced sensitivity to learning rate settings
Effective online active learning in noisy, adversarial environments
Abstract
An importance weight quantifies the relative importance of one example over another, coming up in applications of boosting, asymmetric classification costs, reductions, and active learning. The standard approach for dealing with importance weights in gradient descent is via multiplication of the gradient. We first demonstrate the problems of this approach when importance weights are large, and argue in favor of more sophisticated ways for dealing with them. We then develop an approach which enjoys an invariance property: that updating twice with importance weight is equivalent to updating once with importance weight . For many important losses this has a closed form update which satisfies standard regret guarantees when all examples have . We also briefly discuss two other reasonable approaches for handling large importance weights. Empirically, these approaches yield…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning
