Regret Bounds for Non-decomposable Metrics with Missing Labels

Prateek Jain; Nagarajan Natarajan

arXiv:1606.02077·cs.LG·June 8, 2016·2 cites

Regret Bounds for Non-decomposable Metrics with Missing Labels

Prateek Jain, Nagarajan Natarajan

PDF

Open Access

TL;DR

This paper develops a framework for optimizing non-decomposable metrics like F1 in settings with missing labels, providing theoretical regret bounds and demonstrating improved empirical performance.

Contribution

It introduces a generic approach to handle missing labels in non-decomposable metric optimization with provable regret bounds across multiple learning settings.

Findings

01

Regret bounds are derived for collaborative filtering, multilabel classification, and PU learning.

02

The proposed method outperforms existing approaches that ignore missing label information.

03

Empirical results show significant improvements in F1 score on synthetic and benchmark datasets.

Abstract

We consider the problem of recommending relevant labels (items) for a given data point (user). In particular, we are interested in the practically important setting where the evaluation is with respect to non-decomposable (over labels) performance metrics like the $F_{1}$ measure, and the training data has missing labels. To this end, we propose a generic framework that given a performance metric $Ψ$ , can devise a regularized objective function and a threshold such that all the values in the predicted score vector above and only above the threshold are selected to be positive. We show that the regret or generalization error in the given metric $Ψ$ is bounded ultimately by estimation error of certain underlying parameters. In particular, we derive regret bounds under three popular settings: a) collaborative filtering, b) multilabel classification, and c) PU (positive-unlabeled)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Machine Learning and Data Classification