Information Forests

Zhao Yi; Stefano Soatto; Maneesh Dewan; Yiqiang Zhan

arXiv:1202.1523·cs.LG·March 20, 2015

Information Forests

Zhao Yi, Stefano Soatto, Maneesh Dewan, Yiqiang Zhan

PDF

TL;DR

Information Forests extend Random Forests by using an information divergence criterion for node splitting, aiming to partition data into highly informative subsets to improve classification confidence.

Contribution

The paper introduces a novel classification method that replaces entropy-based splits with divergence-based splits, enhancing the informativeness of data partitions.

Findings

01

Outperforms traditional Random Forests in classification confidence

02

Effectively partitions data into highly informative subsets

03

Relates to active and semi-supervised learning paradigms

Abstract

We describe Information Forests, an approach to classification that generalizes Random Forests by replacing the splitting criterion of non-leaf nodes from a discriminative one -- based on the entropy of the label distribution -- to a generative one -- based on maximizing the information divergence between the class-conditional distributions in the resulting partitions. The basic idea consists of deferring classification until a measure of "classification confidence" is sufficiently high, and instead breaking down the data so as to maximize this measure. In an alternative interpretation, Information Forests attempt to partition the data into subsets that are "as informative as possible" for the purpose of the task, which is to classify the data. Classification confidence, or informative content of the subsets, is quantified by the Information Divergence. Our approach relates to active…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.