Conditional Probability Tree Estimation Analysis and Algorithms

Alina Beygelzimer; John Langford; Yuri Lifshits; Gregory Sorkin,; Alexander L. Strehl

arXiv:1408.2031·cs.LG·August 12, 2014·58 cites

Conditional Probability Tree Estimation Analysis and Algorithms

Alina Beygelzimer, John Langford, Yuri Lifshits, Gregory Sorkin,, Alexander L. Strehl

PDF

Open Access

TL;DR

This paper introduces a new online algorithm for efficiently estimating conditional probabilities over large label sets by constructing a logarithmic depth tree, with theoretical guarantees and empirical validation on a dataset with over a million labels.

Contribution

It presents the first online algorithm that adaptively builds a logarithmic depth tree for label probability estimation, with proven regret bounds and practical effectiveness.

Findings

01

Algorithm constructs a logarithmic depth tree in practice.

02

The method achieves low regret bounds theoretically.

03

Empirical tests show success on datasets with over 10^6 labels.

Abstract

We consider the problem of estimating the conditional probability of a label in time O(log n), where n is the number of possible labels. We analyze a natural reduction of this problem to a set of binary regression problems organized in a tree structure, proving a regret bound that scales with the depth of the tree. Motivated by this analysis, we propose the first online algorithm which provably constructs a logarithmic depth tree on the set of labels to solve this problem. We test the algorithm empirically, showing that it works succesfully on a dataset with roughly 106 labels.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Imbalanced Data Classification Techniques