Propensity-scored Probabilistic Label Trees

Marek Wydmuch; Kalina Jasinska-Kobus; Rohit Babbar; Krzysztof; Dembczy\'nski

arXiv:2110.10803·cs.LG·October 22, 2021

Propensity-scored Probabilistic Label Trees

Marek Wydmuch, Kalina Jasinska-Kobus, Rohit Babbar, Krzysztof, Dembczy\'nski

PDF

1 Repo

TL;DR

This paper introduces an efficient inference method using $A^*$-search for optimal predictions in propensity-scored probabilistic label trees, improving XMLC performance on large, imbalanced label sets.

Contribution

It presents a novel $A^*$-search based inference procedure for probabilistic label trees under the propensity model, enhancing XMLC prediction accuracy.

Findings

01

Efficiently finds optimal label predictions in XMLC tasks.

02

Demonstrates improved performance on benchmark datasets.

03

Validates the approach's effectiveness in real-world scenarios.

Abstract

Extreme multi-label classification (XMLC) refers to the task of tagging instances with small subsets of relevant labels coming from an extremely large set of all possible labels. Recently, XMLC has been widely applied to diverse web applications such as automatic content labeling, online advertising, or recommendation systems. In such environments, label distribution is often highly imbalanced, consisting mostly of very rare tail labels, and relevant labels can be missing. As a remedy to these problems, the propensity model has been introduced and applied within several XMLC algorithms. In this work, we focus on the problem of optimal predictions under this model for probabilistic label trees, a popular approach for XMLC problems. We introduce an inference procedure, based on the $A^{*}$ -search algorithm, that efficiently finds the optimal solution, assuming that all probabilities and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mwydmuch/napkinXC
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.