# A hierarchical Bayesian framework for inferring mitochondrial clonal selection from single-cell data

**Authors:** Aoqi Wang, Yanfei Wang, Xiaona Liu, Qing Wang, Sen Guo, Jianguo Wen, Xiaobo Zhou, Qianqian Song

PMC · DOI: 10.21203/rs.3.rs-8490828/v1 · Research Square · 2026-02-12

## TL;DR

This paper introduces MitoBayes, a statistical tool that identifies how mitochondrial DNA mutations contribute to disease progression by analyzing single-cell data.

## Contribution

MitoBayes is a novel hierarchical Bayesian framework that jointly models mitochondrial clonal lineage and selection pressures in single-cell data.

## Key findings

- MitoBayes accurately recovers selection coefficients across various genetic heterogeneity and data sparsity scenarios.
- The tool reveals disease-specific mitochondrial clonal patterns in Alzheimer’s disease, lung cancer, and hepatocellular carcinoma.
- Elevated MT-ATP6 activity is linked to chemotherapy resistance and poor outcomes in small-cell lung cancer.

## Abstract

Mitochondrial genetic heterogeneity arises from the accumulation of somatic mitochondrial DNA (mtDNA) mutations within individual cells, generating intracellular clonal populations whose selective dynamics in disease remain poorly characterized. Here, we present MitoBayes, a hierarchical Bayesian framework that jointly models mitochondrial clonal lineage structure, allele frequency variation, and single-cell disease-relevant phenotypic burdens to infer clone-specific selection pressures. Extensive benchmarking demonstrates that MitoBayes accurately recovers ground-truth selection coefficients across a wide range of genetic heterogeneity, data sparsity, and lineage complexity scenarios. Application of MitoBayes to single-cell atlases of Alzheimer’s disease (AD) cortex, treatment-naïve non–small-cell lung cancer (NSCLC), and chemotherapy-resistant small-cell lung cancer (SCLC) revealed distinct, disease-specific patterns of mitochondrial clonal selection. These include selective expansion of high-risk mitochondrial clones associated with disruption of PVALB interneuron homeostasis in AD; disease-driven clonal remodeling in cycling T/NK cells from NSCLC tumors characterized by increased mitochondrial biogenesis and impaired immune regulatory programs; and preferential enrichment of a tumor-associated MT-ATP6 (m.8859A>G) clone linked to metabolic adaptation and platinum resistance in SCLC. Pan-cancer survival analyses further confirmed the clinical relevance of elevated MT-ATP6 activity, which was associated with inferior chemotherapy outcomes. Additionally, in hepatocellular carcinoma (HCC), a dominant m.2356C>G clone correlated with POLR2A activation and widespread transcriptional amplification, consistent with a mitochondria–nucleus signaling axis contributing to adverse prognosis in this cancer type. Collectively, these findings establish MitoBayes as a robust statistical framework linking mitochondrial genetic diversity to disease phenotypes and highlight mitochondrial clonal selection as a mechanistically and clinically actionable target for therapeutic and diagnostic development.

## Linked entities

- **Genes:** PVALB (parvalbumin) [NCBI Gene 5816], POLR2A (RNA polymerase II subunit A) [NCBI Gene 5430], ATP6 (ATP synthase F0 subunit 6) [NCBI Gene 4508]
- **Diseases:** Alzheimer’s disease (MONDO:0004975), non–small-cell lung cancer (MONDO:0005233), small-cell lung cancer (MONDO:0008433), hepatocellular carcinoma (MONDO:0007256)

## Full-text entities

- **Genes:** FBXL5 (F-box and leucine rich repeat protein 5) [NCBI Gene 26234] {aka CC2D2A-AS1, FBL4, FBL5, FLR1}, SIDT2 (SID1 transmembrane family member 2) [NCBI Gene 51092] {aka CGI-40}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, CBLL2 (Cbl proto-oncogene like 2) [NCBI Gene 158506] {aka CT138, HAKAIL, ZNF645}, POLR2A (RNA polymerase II subunit A) [NCBI Gene 5430] {aka NEDHIB, POLR2, POLRA, RPB1, RPBh1, RPO2}, ATP6 (ATP synthase F0 subunit 6) [NCBI Gene 4508] {aka ATPase6, MTATP6}, KLHL24 (kelch like family member 24) [NCBI Gene 54800] {aka CMH29, DRE1, EBS6, EBSSH, KRIP6}, MTRNR2L1 (MT-RNR2 like 1 (pseudogene)) [NCBI Gene 100462977] {aka HN1, RNA143598}, PVALB (parvalbumin) [NCBI Gene 5816] {aka D22S749}, EZH2 (enhancer of zeste 2 polycomb repressive complex 2 subunit) [NCBI Gene 2146] {aka ENX-1, ENX1, EZH2b, KMT6, KMT6A, WVS}, ARID1A (AT-rich interaction domain 1A) [NCBI Gene 8289] {aka B120, BAF250, BAF250a, BM029, C1orf4, CSS2}, CDK1 (cyclin dependent kinase 1) [NCBI Gene 983] {aka CDC2, CDC28A, P34CDC2}, NRF1 (nuclear respiratory factor 1) [NCBI Gene 4899] {aka ALPHA-PAL}, MTRNR2L8 (MT-RNR2 like 8 (pseudogene)) [NCBI Gene 100463486] {aka HN8}
- **Diseases:** MCMC (MESH:D007161), cytotoxic (MESH:D064420), immune (MESH:D007154), cognitive decline (MESH:D003072), liver tumor (MESH:D008113), HCC (MESH:D006528), SCLC (MESH:D055752), mitochondrial dysfunction (MESH:D028361), neurodegeneration (MESH:D019636), metabolic syndrome (MESH:D024821), AD (MESH:D000544), lung cancer (MESH:D008175), lung disease (MESH:D008171), Cancer (MESH:D009369), NSCLC (MESH:D002289), cellular disease (MESH:D004806)
- **Chemicals:** MitoBayes (-), ATP (MESH:D000255), sphingolipid (MESH:D013107), platinum (MESH:D010984), iron- (MESH:D007501)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Mutations:** 2356_C>G, m.8859A>G, m.8859A > G, 4314_A>G, 8859A>G, m.8860A>G, m.2356C > G, m.8701A>G, -2356_C>G, Lys->Arg at position 52
- **Cell lines:** GSE157827 — Konosirus punctatus (Dotted gizzard shad), Spontaneously immortalized cell line (CVCL_6F81)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12919214/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12919214/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/PMC12919214/full.md

---
Source: https://tomesphere.com/paper/PMC12919214