Analogical Relevance Index
Suryani Lim, Henri Prade, Gilles Richard

TL;DR
This paper introduces the Analogical Relevance Index (ARI), a new feature significance measure inspired by analogical proportions, which improves feature selection by detecting importance and redundancy in datasets for ML and data mining.
Contribution
The paper presents ARI, a novel filter-based statistical test for feature significance that can also identify feature redundancy, outperforming existing methods.
Findings
ARI effectively detects significant features.
ARI outperforms well-known feature selection methods.
ARI can identify feature redundancy.
Abstract
Focusing on the most significant features of a dataset is useful both in machine learning (ML) and data mining. In ML, it can lead to a higher accuracy, a faster learning process, and ultimately a simpler and more understandable model. In data mining, identifying significant features is essential not only for gaining a better understanding of the data but also for visualization. In this paper, we demonstrate a new way of identifying significant features inspired by analogical proportions. Such a proportion is of the form of "a is to b as c is to d", comparing two pairs of items (a, b) and (c, d) in terms of similarities and dissimilarities. In a classification context, if the similarities/dissimilarities between a and b correlate with the fact that a and b have different labels, this knowledge can be transferred to c and d, inferring that c and d also have different labels. From a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Face and Expression Recognition · Sensory Analysis and Statistical Methods
MethodsTest · Feature Selection
