Determination of class-specific variables in nonparametric multiple-class classification
Wan-Ping Nicole Chen, Yuan-chin Ivan Chang

TL;DR
This paper introduces a probability-based nonparametric multi-class classification method that effectively identifies impactful variables for each class, balancing prediction accuracy with interpretability, especially useful in high-dimensional data scenarios.
Contribution
It proposes a novel nonparametric classification approach that combines high prediction power with the ability to identify class-specific influential variables.
Findings
Prediction power close to Bayes rule
Effective variable identification for each class
Applicable to high-dimensional data
Abstract
As technology advanced, collecting data via automatic collection devices become popular, thus we commonly face data sets with lengthy variables, especially when these data sets are collected without specific research goals beforehand. It has been pointed out in the literature that the difficulty of high-dimensional classification problems is intrinsically caused by too many noise variables useless for reducing classification error, which offer less benefits for decision-making, and increase complexity, and confusion in model-interpretation. A good variable selection strategy is therefore a must for using such kinds of data well; especially when we expect to use their results for the succeeding applications/studies, where the model-interpretation ability is essential. hus, the conventional classification measures, such as accuracy, sensitivity, precision, cannot be the only performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
