Regret Bounds for Non-decomposable Metrics with Missing Labels
Prateek Jain, Nagarajan Natarajan

TL;DR
This paper develops a framework for optimizing non-decomposable metrics like F1 in settings with missing labels, providing theoretical regret bounds and demonstrating improved empirical performance.
Contribution
It introduces a generic approach to handle missing labels in non-decomposable metric optimization with provable regret bounds across multiple learning settings.
Findings
Regret bounds are derived for collaborative filtering, multilabel classification, and PU learning.
The proposed method outperforms existing approaches that ignore missing label information.
Empirical results show significant improvements in F1 score on synthetic and benchmark datasets.
Abstract
We consider the problem of recommending relevant labels (items) for a given data point (user). In particular, we are interested in the practically important setting where the evaluation is with respect to non-decomposable (over labels) performance metrics like the measure, and the training data has missing labels. To this end, we propose a generic framework that given a performance metric , can devise a regularized objective function and a threshold such that all the values in the predicted score vector above and only above the threshold are selected to be positive. We show that the regret or generalization error in the given metric is bounded ultimately by estimation error of certain underlying parameters. In particular, we derive regret bounds under three popular settings: a) collaborative filtering, b) multilabel classification, and c) PU (positive-unlabeled)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Machine Learning and Data Classification
