# Functional protein biomarkers based on distributions of expression levels in single-cell imaging data

**Authors:** Misung Yi, Tingting Zhan, Hallgeir Rui, Inna Chervoneva

PMC · DOI: 10.1093/bioinformatics/btaf182 · Bioinformatics · 2025-04-21

## TL;DR

This paper introduces a new method for analyzing protein expression in cancer cells using quantile functions to improve biomarker prediction.

## Contribution

A novel framework for defining nonlinear quantile index (QI) biomarkers from single-cell protein expression data.

## Key findings

- Nonlinear QI biomarkers outperformed linear ones in predicting cancer progression and mitotic index.
- The method captures heterogeneity in protein expression without requiring spatial information.
- The R packages Qindex and hyper.gam were developed to implement the proposed framework.

## Abstract

The intra-tumor heterogeneity of protein expression is well recognized and may provide important information for cancer prognosis and predicting treatment responses. Analytic methods that account for spatial heterogeneity remain methodologically complex and computationally demanding for single-cell protein expression. For many functional proteins, single-cell expressions vary independently of spatial localization in a substantial proportion of the tumor tissues, and incorporation of spatial information may not affect the prognostic value of such protein biomarkers.

We developed a new framework for using the distributions of functional single-cell protein expression levels as cancer biomarkers. The quantile functions of single-cell expressions are used to fully capture the heterogeneity of protein expression across all cancer cells. The quantile index (QI) biomarker is defined as an integral of an unspecified function which may depend linearly or nonlinearly on a tissue-specific quantile function. Linear and nonlinear versions of QI biomarkers based on single-cell expressions of ER, Ki67, TS, and CyclinD3 were derived and evaluated as predictors of progression-free survival or high mitotic index in a large breast cancer dataset. We evaluated performance and demonstrated the advantages of nonlinear QI biomarkers through simulation studies.

The associated R package Qindex is available at https://CRAN.R-project.org/package=Qindex and R package hyper.gam is available at https://github.com/tingtingzhan/hyper.gam. Examples of R code and detailed instructions could be found in vignette quantile-index-predictor (https://CRAN.R-project.org/package=hyper.gam/vignettes/applications.html#quantile-index-predictor).

## Linked entities

- **Proteins:** EREG (epiregulin), Mki67 (antigen identified by monoclonal antibody Ki 67), CACNA1C (calcium voltage-gated channel subunit alpha1 C)
- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Genes:** CCND3 (cyclin D3) [NCBI Gene 896], EREG (epiregulin) [NCBI Gene 2069] {aka EPR, ER, Ep}
- **Diseases:** cancer (MESH:D009369), breast cancer (MESH:D001943)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12070390/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12070390/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/PMC12070390/full.md

---
Source: https://tomesphere.com/paper/PMC12070390