# BayCount: A Bayesian Decomposition Method for Inferring Tumor   Heterogeneity using RNA-Seq Counts

**Authors:** Fangzheng Xie, Mingyuan Zhou, Yanxun Xu

arXiv: 1702.07981 · 2017-02-28

## TL;DR

BayCount is a Bayesian method that accurately infers tumor heterogeneity from RNA-Seq count data, estimating subclonal proportions and gene expression profiles without normalization.

## Contribution

It introduces a novel Bayesian decomposition approach using negative binomial factor analysis for tumor heterogeneity inference from raw RNA-Seq counts.

## Key findings

- Accurately estimates number of subclones
- Determines subclone proportions in tumors
- Identifies gene expression profiles in subclones

## Abstract

Tumor is heterogeneous - a tumor sample usually consists of a set of subclones with distinct transcriptional profiles and potentially different degrees of aggressiveness and responses to drugs. Understanding tumor heterogeneity is therefore critical to precise cancer prognosis and treatment. In this paper, we introduce BayCount, a Bayesian decomposition method to infer tumor heterogeneity with highly over-dispersed RNA sequencing count data. Using negative binomial factor analysis, BayCount takes into account both the between-sample and gene-specific random effects on raw counts of sequencing reads mapped to each gene. For posterior inference, we develop an efficient compound Poisson based blocked Gibbs sampler. Through extensive simulation studies and analysis of The Cancer Genome Atlas lung cancer and kidney cancer RNA sequencing count data, we show that BayCount is able to accurately estimate the number of subclones, the proportions of these subclones in each tumor sample, and the gene expression profiles in each subclone. Our method represents the first effort in characterizing tumor heterogeneity using RNA sequencing count data that simultaneously removes the need of normalizing the counts, achieves statistical robustness, and obtains biologically and clinically meaningful insights.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1702.07981/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1702.07981/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/1702.07981/full.md

---
Source: https://tomesphere.com/paper/1702.07981