Population structure-learned classifier for high-dimension low-sample-size class-imbalanced problem
Liran Shen, Meng Joo Er, Qingbo Yin

TL;DR
This paper introduces the Population Structure-learned Classifier (PSC), a novel linear classifier designed for high-dimensional, low-sample-size, and class-imbalanced data, demonstrating superior performance over existing methods.
Contribution
The paper proposes a new classifier that maximizes inter- and intra-class scatter, adapts intercepts for each class, and efficiently handles high-dimensional, imbalanced datasets.
Findings
PSC outperforms state-of-the-art methods on benchmark datasets.
It effectively handles high-dimensional, low-sample-size, and imbalanced data.
The method maintains computational complexity similar to SVM.
Abstract
The Classification on high-dimension low-sample-size data (HDLSS) is a challenging problem and it is common to have class-imbalanced data in most application fields. We term this as Imbalanced HDLSS (IHDLSS). Recent theoretical results reveal that the classification criterion and tolerance similarity are crucial to HDLSS, which emphasizes the maximization of within-class variance on the premise of class separability. Based on this idea, a novel linear binary classifier, termed Population Structure-learned Classifier (PSC), is proposed. The proposed PSC can obtain better generalization performance on IHDLSS by maximizing the sum of inter-class scatter matrix and intra-class scatter matrix on the premise of class separability and assigning different intercept values to majority and minority classes. The salient features of the proposed approach are: (1) It works well on IHDLSS; (2) The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSupport Vector Machine
