Re-analysis of the Human Transcription Factor Atlas Recovers TF-Specific Signatures from Pooled Single-Cell Screens with Missing Controls
Arka Jain, Umesh Sharma

TL;DR
This study re-analyzed the human TF Atlas pooled single-cell dataset, developing a pipeline to recover TF-specific signatures despite missing controls, revealing key TFs and pathways involved in transcriptional regulation.
Contribution
The paper introduces a reproducible re-analysis pipeline that recovers TF-specific signatures from pooled single-cell data lacking internal controls.
Findings
Recovered TF-specific signatures for 59 of 61 testable TFs
Identified key TFs like HOPX, MAZ, PAX6, FOS, FEZF2 as strong remodelers
Validated TF effect sizes against published rankings
Abstract
Public pooled single-cell perturbation atlases are valuable resources for studying transcription factor (TF) function, but downstream re-analysis can be limited by incomplete deposited metadata and missing internal controls. Here we re-analyze the human TF Atlas dataset (GSE216481), a MORF-based pooled overexpression screen spanning 3,550 TF open reading frames and 254,519 cells, with a reproducible pipeline for quality control, MORF barcode demultiplexing, per-TF differential expression, and functional enrichment. From 77,018 cells in the pooled screen, we assign 60,997 (79.2\%) to 87 TF identities. Because the deposited barcode mapping lacks the GFP and mCherry negative controls present in the original library, we use embryoid body (EB) cells as an external baseline and remove shared batch/transduction artifacts by background subtraction. This strategy recovers TF-specific signatures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
