
TL;DR
This paper introduces the min-max kernel for nonnegative data, demonstrates its effectiveness in classification, and proposes a hashing-based linearization technique called the 0-bit scheme for scalable large-scale applications.
Contribution
It presents a novel linearization method for the min-max kernel using a simplified hashing scheme, enabling efficient large-scale classification with nonlinear similarity measures.
Findings
Min-max kernel improves similarity measurement for nonnegative data.
The 0-bit hashing scheme retains essential information for classification.
Empirical results show effective large-scale classification using the proposed method.
Abstract
The min-max kernel is a generalization of the popular resemblance kernel (which is designed for binary data). In this paper, we demonstrate, through an extensive classification study using kernel machines, that the min-max kernel often provides an effective measure of similarity for nonnegative data. As the min-max kernel is nonlinear and might be difficult to be used for industrial applications with massive data, we show that the min-max kernel can be linearized via hashing techniques. This allows practitioners to apply min-max kernel to large-scale applications using well matured linear algorithms such as linear SVM or logistic regression. The previous remarkable work on consistent weighted sampling (CWS) produces samples in the form of () where the records the location (and in fact also the weights) information analogous to the samples produced by classical minwise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Neural Networks and Applications · Machine Learning and Data Classification
MethodsSupport Vector Machine
