StackFeat RL: Reinforcement Learning over Iterative Dual Criterion Feature Selection for Stable Biomarker Discovery

A. Yermekov; D.A. Herrera-Mart\'i

arXiv:2604.22892·cs.LG·May 6, 2026

StackFeat RL: Reinforcement Learning over Iterative Dual Criterion Feature Selection for Stable Biomarker Discovery

A. Yermekov, D.A. Herrera-Mart\'i

PDF

TL;DR

StackFeat-RL is a reinforcement learning framework that optimizes iterative dual-criterion feature selection, leading to more accurate, sparse, and stable biomarker discovery in high-dimensional genomic data.

Contribution

It introduces a novel meta-learning approach using REINFORCE to optimize dual-criterion feature selection, improving stability and accuracy over existing methods.

Findings

01

Achieves highest predictive accuracy on COVID-19 and Alzheimer's datasets.

02

Requires 3-4 times fewer features than competing methods.

03

Provides convergence guarantees through iterative accumulation.

Abstract

Feature selection in high-dimensional genomic data ( $d ≫ n$ ) demands methods that are simultaneously accurate, sparse, and stable. Existing approaches either require manual threshold specification (mRMR, stability selection), produce unstable selections under data perturbation (Lasso, Boruta), or ignore biological structure entirely. We introduce StackFeat-RL, a meta-learning framework that optimises the hyperparameters of an iterative dual-criterion feature selection algorithm via REINFORCE policy gradients. The dual criterion, requiring both coefficient consistency and selection frequency, guards against two failure modes missed by single-criterion methods, while iterative accumulation provides convergence guarantees via the law of large numbers. On COVID-19 miRNA data (GSE240888, 332 features) and three Alzheimer's disease classification tasks (GSE84422, 13237 genes; Normal vs.\…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.