# The impact of ambient contamination on demultiplexing methods for single-nucleus multiome experiments

**Authors:** Terence Li, Marcus Alvarez, Cuining Liu, Kevin Abuhanna, Yu Sun, Jason Ernst, Kathrin Plath, Brunilda Balliu, Chongyuan Luo, Noah Zaitlen

PMC · DOI: 10.21203/rs.3.rs-5977005/v1 · Research Square · 2025-02-10

## TL;DR

This paper studies how ambient contamination affects the accuracy of demultiplexing methods in single-nucleus multiome experiments and introduces a new tool and metric to improve this.

## Contribution

The paper introduces ambisim, a genotype-aware simulator, and a new metric called variant consistency to better model ambient contamination in demultiplexing.

## Key findings

- Demultiplexing methods are variably impacted by ambient contamination in both RNA and ATAC modalities.
- Concordance between demultiplexing methods is highly variable in joint snRNA/snATAC datasets.
- The new variant consistency metric correlates with ambient molecule fractions in singlets.

## Abstract

Sample multiplexing has become an increasingly common design choice in droplet-based single-nucleus multi-omic sequencing experiments to reduce costs and remove technical variation. Genotype-based demultiplexing is one popular class of methods that was originally developed for single-cell RNA-seq, but has not been rigorously benchmarked in other assays, such as snATAC-seq and joint snRNA/snATAC assays, especially in the context of variable ambient RNA/DNA contamination. To address this, we develop ambisim, a genotype-aware read-level simulator that can flexibly control ambient molecule proportions and generate realistic joint snRNA/snATAC data. We use ambisim to evaluate demultiplexing methods across several important parameters: doublet rate, number of multiplexed donors, and coverage levels. Our simulations reveal that methods are variably impacted by ambient contamination in both modalities. We then applied the demultiplexing methods to two joint snRNA/snATAC datasets and found highly variable concordance between methods in both modalities. Finally, we develop a new metric, variant consistency, which we show is correlated with cell-level ambient molecule fractions in singlets. Applying our metric to two multiplexed joint snRNA/snATAC datasets reveals variable ambient contamination across experiments and modalities. We conclude that improved modelling of ambient material in demultiplexing algorithms will increase both sensitivity and specificity.

## Full-text entities

- **Genes:** XCL1 (X-C motif chemokine ligand 1) [NCBI Gene 6375] {aka ATAC, LPTN, LTN, SCM-1, SCM-1a, SCM1}, LGR5 (leucine rich repeat containing G protein-coupled receptor 5) [NCBI Gene 8549] {aka FEX, GPR49, GPR67, GRP49, HG38}
- **Chemicals:** Demuxalot (-)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11844637/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11844637/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/PMC11844637/full.md

---
Source: https://tomesphere.com/paper/PMC11844637