Similarity Learning for Provably Accurate Sparse Linear Classification
Aurelien Bellet (University of Saint-Etienne), Amaury Habrard, (University of Saint-Etienne), Marc Sebban (University of Saint-Etienne)

TL;DR
This paper introduces a novel algorithm for learning a non-PSD linear similarity in a nonlinear feature space, with proven stability and generalization bounds, leading to sparse and effective classifiers.
Contribution
It establishes a theoretical link between learned similarities and classification performance, and proposes a fast, robust method for sparse linear classification.
Findings
The approach is faster than existing methods.
It produces classifiers that are robust to overfitting.
The classifiers are very sparse and effective.
Abstract
In recent years, the crucial importance of metrics in machine learning algorithms has led to an increasing interest for optimizing distance and similarity functions. Most of the state of the art focus on learning Mahalanobis distances (requiring to fulfill a constraint of positive semi-definiteness) for use in a local k-NN algorithm. However, no theoretical link is established between the learned metrics and their performance in classification. In this paper, we make use of the formal framework of good similarities introduced by Balcan et al. to design an algorithm for learning a non PSD linear similarity optimized in a nonlinear feature space, which is then used to build a global linear classifier. We show that our approach has uniform stability and derive a generalization bound on the classification error. Experiments performed on various datasets confirm the effectiveness of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
Methodsk-Nearest Neighbors
