# scFSNN: a feature selection method based on neural network for single-cell RNA-seq data

**Authors:** Minjiao Peng, Baoqin Lin, Jun Zhang, Yan Zhou, Bingqing Lin

PMC · DOI: 10.1186/s12864-024-10160-1 · BMC Genomics · 2024-03-08

## TL;DR

This paper introduces scFSNN, a neural network-based method for selecting important genes in single-cell RNA sequencing data.

## Contribution

The novelty lies in using a neural network to automatically select features while controlling false discovery rates and adapting to data complexity.

## Key findings

- scFSNN outperforms existing methods in feature selection for scRNA-seq data.
- The method adapts to data characteristics like over-dispersion and zero-inflation.
- Simulation and real data studies confirm its strong predictive performance.

## Abstract

While single-cell RNA sequencing (scRNA-seq) allows researchers to analyze gene expression in individual cells, its unique characteristics like over-dispersion, zero-inflation, high gene-gene correlation, and large data volume with many features pose challenges for most existing feature selection methods. In this paper, we present a feature selection method based on neural network (scFSNN) to solve classification problem for the scRNA-seq data. scFSNN is an embedded method that can automatically select features (genes) during model training, control the false discovery rate of selected features and adaptively determine the number of features to be eliminated. Extensive simulation and real data studies demonstrate its excellent feature selection ability and predictive performance.

The online version contains supplementary material available at 10.1186/s12864-024-10160-1.

## Full-text entities

- **Genes:** REG4 (regenerating family member 4) [NCBI Gene 83998] {aka GISP, REG-IV, RELP}, Lgr5 (leucine rich repeat containing G protein coupled receptor 5) [NCBI Gene 14160] {aka FEX, Gpr49}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, SYT1 (synaptotagmin 1) [NCBI Gene 6857] {aka BAGOS, P65, SVP65, SYT}, SNAP25 (synaptosome associated protein 25) [NCBI Gene 6616] {aka CMS18, DEE117, RIC-4, RIC4, SEC9, SNAP}
- **Diseases:** COVID-19 (MESH:D000086382), AD (MESH:D000544), dementia (MESH:D003704)
- **Chemicals:** SZU (-)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10924397/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10924397/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/PMC10924397/full.md

---
Source: https://tomesphere.com/paper/PMC10924397