Data as Voters: Core Set Selection Using Approval-Based Multi-Winner Voting

Luis S\'anchez-Fern\'andez; Jes\'us A. Fisteus; Rafael L\'opez-Zaragoza

arXiv:2304.09995·cs.LG·December 15, 2025·1 cites

Data as Voters: Core Set Selection Using Approval-Based Multi-Winner Voting

Luis S\'anchez-Fern\'andez, Jes\'us A. Fisteus, Rafael L\'opez-Zaragoza

PDF

Open Access

TL;DR

This paper introduces a novel data selection method for machine learning based on approval voting, where instances serve as both voters and candidates, leading to improved classifier performance.

Contribution

It proposes a new core set selection approach using approval-based multi-winner voting, integrating voting theory into data reduction for machine learning.

Findings

01

Improves classifier performance in several cases

02

Statistically significant improvements over state-of-the-art methods

03

Applicable to neural networks, KNN, and SVM classifiers

Abstract

We present a novel approach to the core set/instance selection problem in machine learning. Our approach is based on recent results on (proportional) representation in approval-based multi-winner elections. In our model, instances play a double role as voters and candidates. The approval set of each instance in the training set (acting as a voter) is defined from the concept of local set, which already exists in the literature. We then select the election winners by using a representative voting rule, and such winners are the data instances kept in the reduced training set. We evaluate our approach in two experiments involving neural network classifiers and classic machine learning classifiers (KNN and SVM). Our experiments show that, in several cases, our approach improves the performance of state-of-the-art methods, and the differences are statistically significant.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Voting Systems · Rough Sets and Fuzzy Logic · Internet Traffic Analysis and Secure E-voting