Scalable Prototype Selection by Genetic Algorithms and Hashing
Yenisel Plasencia-Cala\~na, Mauricio Orozco-Alzate, Heydi, M\'endez-V\'azquez, Edel Garc\'ia-Reyes, Robert P.W. Duin

TL;DR
This paper introduces scalable prototype selection methods for dissimilarity-based classification using genetic algorithms and hashing, enabling efficient handling of large datasets with competitive prototype quality.
Contribution
It proposes novel scalable prototype selection algorithms combining genetic algorithms, hashing, and new criteria for large dissimilarity datasets.
Findings
Methods effectively select high-quality prototypes from large datasets.
Algorithms operate with low computational runtimes.
Experimental results demonstrate improved scalability and prototype quality.
Abstract
Classification in the dissimilarity space has become a very active research area since it provides a possibility to learn from data given in the form of pairwise non-metric dissimilarities, which otherwise would be difficult to cope with. The selection of prototypes is a key step for the further creation of the space. However, despite previous efforts to find good prototypes, how to select the best representation set remains an open issue. In this paper we proposed scalable methods to select the set of prototypes out of very large datasets. The methods are based on genetic algorithms, dissimilarity-based hashing, and two different unsupervised and supervised scalable criteria. The unsupervised criterion is based on the Minimum Spanning Tree of the graph created by the prototypes as nodes and the dissimilarities as edges. The supervised criterion is based on counting matching labels of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Machine Learning and Data Classification · Face and Expression Recognition
