Supervised Feature Selection via Dependence Estimation
Le Song, Alex Smola, Arthur Gretton, Karsten Borgwardt, Justin Bedo

TL;DR
This paper presents a new supervised feature selection method that uses the Hilbert-Schmidt Independence Criterion to measure dependence between features and labels, unifying various learning tasks.
Contribution
It introduces a dependence-based feature filtering framework applicable to classification and regression, with an efficient backward-elimination algorithm for approximation.
Findings
Effective on artificial datasets
Demonstrates usefulness on real-world datasets
Unifies feature selection across supervised tasks
Abstract
We introduce a framework for filtering features that employs the Hilbert-Schmidt Independence Criterion (HSIC) as a measure of dependence between the features and the labels. The key idea is that good features should maximise such dependence. Feature selection for various supervised learning problems (including classification and regression) is unified under this framework, and the solutions can be approximated using a backward-elimination algorithm. We demonstrate the usefulness of our method on both artificial and real world datasets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and Data Classification
