Transparent Single-Cell Set Classification with Kernel Mean Embeddings

Siyuan Shan; Vishal Baskaran; Haidong Yi; Jolene Ranek; Natalie; Stanley; Junier Oliva

arXiv:2201.07322·cs.LG·June 29, 2022

Transparent Single-Cell Set Classification with Kernel Mean Embeddings

Siyuan Shan, Vishal Baskaran, Haidong Yi, Jolene Ranek, Natalie, Stanley, Junier Oliva

PDF

1 Repo

TL;DR

This paper introduces a transparent, kernel mean embedding-based method for classifying single-cell data that achieves high accuracy with interpretability and lower computational cost compared to deep learning models.

Contribution

The paper presents a novel application of Kernel Mean Embedding for transparent classification of single-cell data, matching or surpassing state-of-the-art accuracy with simpler, interpretable models.

Findings

01

Achieves comparable or better accuracy than gating-free methods.

02

Model is simpler, with fewer parameters, and easier to interpret.

03

Provides biological insights linking cellular heterogeneity to phenotypes.

Abstract

Modern single-cell flow and mass cytometry technologies measure the expression of several proteins of the individual cells within a blood or tissue sample. Each profiled biological sample is thus represented by a set of hundreds of thousands of multidimensional cell feature vectors, which incurs a high computational cost to predict each biological sample's associated phenotype with machine learning models. Such a large set cardinality also limits the interpretability of machine learning models due to the difficulty in tracking how each individual cell influences the ultimate prediction. We propose using Kernel Mean Embedding to encode the cellular landscape of each profiled biological sample. Although our foremost goal is to make a more transparent model, we find that our method achieves comparable or better accuracies than the state-of-the-art gating-free methods through a simple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shansiliu95/ckme
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.