Similarity-based Learning via Data Driven Embeddings
Purushottam Kar, Prateek Jain

TL;DR
This paper introduces a flexible, data-driven framework for similarity-based classification that learns the best similarity functions and landmark points, leading to improved performance across various datasets.
Contribution
It unifies previous frameworks, allows data-driven learning of similarity goodness, and introduces a diversity heuristic for landmark selection, enhancing classification accuracy.
Findings
Outperforms existing methods on multiple datasets
Successfully learns the most suitable similarity functions for tasks
Effective diversity-based landmark selection improves results
Abstract
We consider the problem of classification using similarity/distance functions over data. Specifically, we propose a framework for defining the goodness of a (dis)similarity function with respect to a given learning task and propose algorithms that have guaranteed generalization properties when working with such good functions. Our framework unifies and generalizes the frameworks proposed by [Balcan-Blum ICML 2006] and [Wang et al ICML 2007]. An attractive feature of our framework is its adaptability to data - we do not promote a fixed notion of goodness but rather let data dictate it. We show, by giving theoretical guarantees that the goodness criterion best suited to a problem can itself be learned which makes our approach applicable to a variety of domains and problems. We propose a landmarking-based approach to obtaining a classifier from such learned goodness criteria. We then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
