Robust Similarity and Distance Learning via Decision Forests
Tyler M. Tomita, Joshua T. Vogelstein

TL;DR
This paper introduces SMERF, a decision forest-based algorithm that learns expressive, data-driven similarity and distance measures, outperforming traditional linear methods in capturing complex relationships.
Contribution
The paper presents a novel decision forest approach for distance learning that generalizes standard trees and improves expressiveness while maintaining interpretability.
Findings
SMERF effectively approximates arbitrary distances.
It identifies important features accurately.
It predicts network links with high accuracy.
Abstract
Canonical distances such as Euclidean distance often fail to capture the appropriate relationships between items, subsequently leading to subpar inference and prediction. Many algorithms have been proposed for automated learning of suitable distances, most of which employ linear methods to learn a global metric over the feature space. While such methods offer nice theoretical properties, interpretability, and computationally efficient means for implementing them, they are limited in expressive capacity. Methods which have been designed to improve expressiveness sacrifice one or more of the nice properties of the linear methods. To bridge this gap, we propose a highly expressive novel decision forest algorithm for the task of distance learning, which we call Similarity and Metric Random Forests (SMERF). We show that the tree construction procedure in SMERF is a proper generalization of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Face and Expression Recognition · Bayesian Modeling and Causal Inference
