Hierarchical novel class discovery for single-cell transcriptomic   profiles

Malek Senoussi; Thierry Arti\`eres; Paul Villoutreix

arXiv:2409.05937·q-bio.GN·September 11, 2024

Hierarchical novel class discovery for single-cell transcriptomic profiles

Malek Senoussi, Thierry Arti\`eres, Paul Villoutreix

PDF

Open Access 1 Repo

TL;DR

This paper introduces hierarchical clustering methods tailored for single-cell transcriptomic data, enabling simultaneous discovery of novel cell types and their annotation by leveraging the data's hierarchical structure.

Contribution

It proposes extensions of k-Means and GMM clustering algorithms specifically designed for hierarchical single-cell data, addressing the novel class discovery challenge.

Findings

01

Effective clustering of artificial and experimental datasets

02

Improved annotation accuracy for novel cell types

03

Leveraging hierarchical structure enhances discovery

Abstract

One of the major challenges arising from single-cell transcriptomics experiments is the question of how to annotate the associated single-cell transcriptomic profiles. Because of the large size and the high dimensionality of the data, automated methods for annotation are needed. We focus here on datasets obtained in the context of developmental biology, where the differentiation process leads to a hierarchical structure. We consider a frequent setting where both labeled and unlabeled data are available at training time, but the sets of the labels of labeled data on one side and of the unlabeled data on the other side, are disjoint. It is an instance of the Novel Class Discovery problem. The goal is to achieve two objectives, clustering the data and mapping the clusters with labels. We propose extensions of k-Means and GMM clustering methods for solving the problem and report comparative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MalekSnous/hNCD-scRNAseq
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSingle-cell and spatial transcriptomics · Gene expression and cancer classification

MethodsFocus