TL;DR
This paper introduces a fast feature selection method for linear classification based on Orthogonal Least Squares, utilizing a new correlation measure, and demonstrates its efficiency and effectiveness over existing methods.
Contribution
The paper proposes a novel SOCC criterion for OLS-based feature selection, revealing its statistical significance and advantages over mutual information methods.
Findings
The proposed method ranks among the top 5 in performance across tested methods.
It operates efficiently with continuous features without discretization.
It outperforms or matches existing methods on synthetic and real datasets.
Abstract
An Orthogonal Least Squares (OLS) based feature selection method is proposed for both binomial and multinomial classification. The novel Squared Orthogonal Correlation Coefficient (SOCC) is defined based on Error Reduction Ratio (ERR) in OLS and used as the feature ranking criterion. The equivalence between the canonical correlation coefficient, Fisher's criterion, and the sum of the SOCCs is revealed, which unveils the statistical implication of ERR in OLS for the first time. It is also shown that the OLS based feature selection method has speed advantages when applied for greedy search. The proposed method is comprehensively compared with the mutual information based feature selection methods and the embedded methods using both synthetic and real world datasets. The results show that the proposed method is always in the top 5 among the 12 candidate methods. Besides, the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection
