Federated-inspired Single-cell Batch Integration in Latent Space
Quang-Huy Nguyen, Zongliang Yue, Hao Chen, Wei-Shinn Ku, Jiaqi Wang

TL;DR
scBatchProx is a federated-inspired method that refines cell embeddings in single-cell RNA sequencing data, improving batch correction and biological signal preservation without centralized data sharing.
Contribution
Introduces scBatchProx, a lightweight, post-hoc optimization technique for batch correction in latent space inspired by federated learning, applicable to arbitrary upstream methods.
Findings
Achieves 3-8% improvement in embedding quality.
Improves batch correction in 90% of cases.
Enhances biological conservation in 85% of data-method pairs.
Abstract
Advances in single-cell RNA sequencing enable the rapid generation of massive, high-dimensional datasets, yet the accumulation of data across experiments introduces batch effects that obscure true biological signals. Existing batch correction approaches either insufficiently correct batch effects or require centralized retraining on the complete dataset, limiting their applicability in distributed and continually evolving single-cell data settings. We introduce scBatchProx, a post-hoc optimization method inspired by federated learning principles for refining cell-level embeddings produced by arbitrary upstream methods. Treating each batch as a client, scBatchProx learns batch-conditioned adapters under proximal regularization, correcting batch structure directly in latent space without requiring raw expression data or centralized optimization. The method is lightweight and deployable,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Cell Image Analysis Techniques · Microfluidic and Bio-sensing Technologies
