# scMEDAL for the interpretable analysis of single-cell transcriptomics data with batch effect visualization using a deep mixed effects autoencoder

**Authors:** Aixa X. Andrade, Son Nguyen, Albert Montillo

PMC · DOI: 10.21203/rs.3.rs-6081478/v1 · 2025-03-19

## TL;DR

scMEDAL is a new method for analyzing single-cell RNA data that separates batch effects to improve accuracy and understanding of cellular differences.

## Contribution

scMEDAL introduces a novel deep autoencoder framework that separately models batch-invariant and batch-specific effects in single-cell transcriptomics.

## Key findings

- scMEDAL suppresses batch effects while modeling batch-specific variation, improving accuracy and interpretability.
- The framework enables retrospective analyses by predicting cell expression as if acquired in a different batch.
- Combining latent spaces enhances predictions of disease status, donor group, and cell type.

## Abstract

scRNA-seq data has the potential to provide new insights into cellular heterogeneity and data acquisition; however, a major challenge is unraveling confounding from technical and biological batch effects. Existing batch correction algorithms suppress and discard these effects, rather than quantifying and modeling them. Here, we present scMEDAL, a framework for single-cell
Mixed
Effects
Deep
Autoencoder
Learning, which separately models batch-invariant and batch-specific effects using two complementary autoencoder networks. One network is trained through adversarial learning to capture a batch-invariant representation, while a Bayesian autoencoder learns a batch-specific representation. Comprehensive evaluations spanning conditions (e.g., autism, leukemia, and cardiovascular), cell types, and technical and biological effects demonstrate that scMEDAL suppresses batch effects while modeling batch-specific variation, enhancing accuracy and interpretability. Unlike prior approaches, the framework’s fixed- and random-effects autoencoders enable retrospective analyses, including predicting a cell’s expression as if it had been acquired in a different batch via genomap projections at the cellular level, revealing the impact of biological (e.g., diagnosis) and technical (e.g., acquisition) effects. By combining scMEDAL’s batch-agnostic and batch-specific latent spaces, it enables more accurate predictions of disease status, donor group, and cell type, making scMEDAL a valuable framework for gaining deeper insight into data acquisition and cellular heterogeneity.

## Linked entities

- **Diseases:** autism (MONDO:0005260), leukemia (MONDO:0004355)

## Full-text entities

- **Diseases:** autism (MESH:D001321), leukemia (MESH:D007938)

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11957221/full.md

---
Source: https://tomesphere.com/paper/PMC11957221