Flexible High-dimensional Classification Machines and Their Asymptotic Properties
Xingye Qiao (1), Lingsong Zhang (2) ((1) State University of New York, at Binghamton, (2) Purdue University)

TL;DR
This paper introduces the FLAME family of classifiers, unifying SVM and DWD, to address high-dimensional overfitting and class imbalance issues, with theoretical analysis and practical demonstrations.
Contribution
The paper proposes FLAME, a unified classification framework that improves upon SVM and DWD by balancing overfitting and imbalance sensitivity, with asymptotic properties analyzed.
Findings
FLAME unifies SVM and DWD, revealing their intrinsic connection.
FLAME improves classification performance in high-dimensional, imbalanced data.
Theoretical properties of FLAME are established and validated through simulations and real data.
Abstract
Classification is an important topic in statistics and machine learning with great potential in many real applications. In this paper, we investigate two popular large margin classification methods, Support Vector Machine (SVM) and Distance Weighted Discrimination (DWD), under two contexts: the high-dimensional, low-sample size data and the imbalanced data. A unified family of classification machines, the FLexible Assortment MachinE (FLAME) is proposed, within which DWD and SVM are special cases. The FLAME family helps to identify the similarities and differences between SVM and DWD. It is well known that many classifiers overfit the data in the high-dimensional setting; and others are sensitive to the imbalanced data, that is, the class with a larger sample size overly influences the classifier and pushes the decision boundary towards the minority class. SVM is resistant to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Face and Expression Recognition · Text and Document Classification Technologies
MethodsSupport Vector Machine
