# PURE-seq integrates FACS and PIP-seq for single-cell genomics of ultra-rare cells

**Authors:** Sixuan Pan, Inés Fernández-Maestre, Kai-Chun Chang, Stéphane Van Haver, Matthew G. Wereski, Alexandra M. Haugh, Katy K. Tsai, Adil I. Daud, Robert L. Bowman, Harish N. Vasudevan, Ross L. Levine, Adam R. Abate

PMC · DOI: 10.1038/s41467-025-68146-w · Nature Communications · 2026-01-21

## TL;DR

PURE-seq is a new method that combines FACS and PIP-seq to efficiently sequence ultra-rare cells, enabling detailed study of rare cell populations like cancer cells and aging stem cells.

## Contribution

PURE-seq integrates FACS and PIP-seq to directly sequence ultra-rare cells at a rarity of 1 in 1,000,000.

## Key findings

- PURE-seq successfully isolated and sequenced circulating tumor cells from metastatic melanoma patient blood.
- Transcriptomic analysis identified Egr1 as a putative master regulator of murine hematopoietic stem and progenitor cell aging.

## Abstract

Single-cell transcriptomics is valuable for uncovering individual cell properties, particularly in heterogeneous systems. However, this technique often results in the reanalysis of many well-characterized cells, increasing costs and diluting rare cell populations. To address this, we develop PIP-seq for Rare-cell Enrichment and Sequencing (PURE-seq). PURE-seq allows direct FACS sorting of cells into PIP-seq reactions, minimizing handling and reducing cell loss. PURE-seq reliably sequences ultrarare cells, with 1 hour of sorting capturing tens of target cells at a rarity of 1 in 1,000,000. Leveraging this extreme sensitivity, we use PURE-seq to isolate and single-cell sequence circulating tumor cells from metastatic melanoma patient blood, obtaining detailed single cancer cell gene expression profiles. Additionally, we use PURE-seq to examine hematopoietic stem and progenitor cells from young, old and middle-aged mice. Transcriptomic analysis identifies Egr1 as a putative master regulator of murine hematopoietic stem and progenitor cell aging, demonstrating PURE-seq’s utility as a discovery platform for basic science applications. PURE-seq offers a simple and highly sensitive method for single-cell sequencing ultra-rare cells.

Rare cells are often biologically and clinically important, but their low abundance makes them challenging to study using single-cell transcriptomics. Here, the authors develop PURE-seq which integrates FACS and PIP-seq to directly sequence ultra-rare cells. It captures cells at 1 in 1,000,000 rarity, which the authors demonstrate by profiling circulating tumor cells and identifying Egr1 as a regulator of mouse hematopoietic stem cell aging.

## Linked entities

- **Genes:** EGR1 (early growth response 1) [NCBI Gene 1958]
- **Diseases:** metastatic melanoma (MONDO:0005191)
- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Genes:** Pramex1 (PRAME like, X-linked 1) [NCBI Gene 75829] {aka 4930534P07Rik, Prame}, SPAG17 (sperm associated antigen 17) [NCBI Gene 200162] {aka CT143, PF6, SPGF55}, Dapk1 (death associated protein kinase 1) [NCBI Gene 69635] {aka D13Ucla1, DAP-Kinase}, Epcam (epithelial cell adhesion molecule) [NCBI Gene 17075] {aka CD326, EGP, EGP-2, Egp314, Ep-CAM, EpCAM1}, Notch2 (notch 2) [NCBI Gene 18129] {aka N2}, Cd48 (CD48 antigen) [NCBI Gene 12506] {aka BCM1, BLAST, BLAST-1, BLAST1, Bcm-1, MEM-102}, PTPRC (protein tyrosine phosphatase receptor type C) [NCBI Gene 5788] {aka B220, CD45, CD45R, GP180, IMD105, L-CA}, PPBP (pro-platelet basic protein) [NCBI Gene 5473] {aka B-TG1, Beta-TG, CTAP-III, CTAP3, CTAPIII, CXCL7}, COL19A1 (collagen type XIX alpha 1 chain) [NCBI Gene 1310] {aka COL9A1L, D6S228E}, PECAM1 (platelet and endothelial cell adhesion molecule 1) [NCBI Gene 5175] {aka CD31, CD31/EndoCAM, GPIIA', PECA1, PECAM-1, endoCAM}, Stat3 (signal transducer and activator of transcription 3) [NCBI Gene 20848] {aka 1110034C02Rik, Aprf}, Procr (protein C receptor, endothelial) [NCBI Gene 19124] {aka Ccca, Ccd41, Epcr}, SDK1 (sidekick cell adhesion molecule 1) [NCBI Gene 221935], VWF (von Willebrand factor) [NCBI Gene 7450] {aka F8VWF, VWD}, Il6 (interleukin 6) [NCBI Gene 16193] {aka Il-6}, Meis1 (Meis homeobox 1) [NCBI Gene 17268] {aka C530044H18Rik, Evi8}, Egr3 (early growth response 3) [NCBI Gene 13655] {aka EGR-3, Pilot}, Npas2 (neuronal PAS domain protein 2) [NCBI Gene 18143] {aka MOP4, bHLHe9}, SLIT3 (slit guidance ligand 3) [NCBI Gene 6586] {aka MEGF5, SLIL2, SLIT1, Slit-3, slit2}, Ido1 (indoleamine 2,3-dioxygenase 1) [NCBI Gene 15930] {aka Ido, Indo}, Egr1 (early growth response 1) [NCBI Gene 13653] {aka A530045N19Rik, ETR103, Egr-1, Krox-1, Krox-24, Krox24}, Ly6a (lymphocyte antigen 6 family member A) [NCBI Gene 110454] {aka Ly-6A.2, Ly-6A/E, Ly-6E.1, Sca-1, Sca1, TAP}, Orc1 (origin recognition complex, subunit 1) [NCBI Gene 18392] {aka MmORC1, Orc1l}, Junb (jun B proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 16477], CSMD2 (CUB and Sushi multiple domains 2) [NCBI Gene 114784] {aka dJ1007G16.1, dJ1007G16.2, dJ947L8.1}, EPCAM (epithelial cell adhesion molecule) [NCBI Gene 4072] {aka Ber-Ep4, BerEp4, DIAR5, EGP-2, EGP314, EGP40}, Svil (supervillin) [NCBI Gene 225115] {aka B430302E16Rik}, Itga2b (integrin alpha 2b) [NCBI Gene 16399] {aka CD41, CD41B, GpIIb, alphaIIb}, FLT1 (fms related receptor tyrosine kinase 1) [NCBI Gene 2321] {aka FLT, FLT-1, VEGFR-1, VEGFR1}, Slamf1 (signaling lymphocytic activation molecule family member 1) [NCBI Gene 27218] {aka 4933415F16, CD150, CDw150, ESTM51, IPO-3, Slam}, Mpl (Mpl proto-oncogene, thrombopoietin receptor) [NCBI Gene 17480] {aka CD110, TPO-R, c-mpl, hlb219}, KALRN (kalirin RhoGEF kinase) [NCBI Gene 8997] {aka ARHGEF24, CHDS5, DUET, DUO, HAPIP, KALNC2}, Klf4 (Kruppel-like transcription factor 4 (gut)) [NCBI Gene 16600] {aka EZF, Gklf, Zie}, Cd79a (CD79A antigen (immunoglobulin-associated alpha)) [NCBI Gene 12518] {aka Ig-alpha, Iga, Igalpha, Ly-54, Ly54, mb-1}, l9Rl1 (lethal, Chr 9, Russell 1) [NCBI Gene 117202] {aka pl-1}, KIAA1217 (KIAA1217) [NCBI Gene 56243] {aka ETL4, SKT}, GNLY (granulysin) [NCBI Gene 10578] {aka D2S69E, LAG-2, LAG2, NKG5, TLA519}, Mecom (MDS1 and EVI1 complex locus) [NCBI Gene 14013] {aka D630039M04Rik, Evi-1, Evi1, Jbo, Mds, Mds1}, FAP (fibroblast activation protein alpha) [NCBI Gene 2191] {aka DPPIV, FAPA, FAPalpha, SIMP}, Mitf (melanogenesis associated transcription factor) [NCBI Gene 17342] {aka BCC2, Bhlhe32, Gsfbcc2, Vitiligo, Wh, bw}, GRIN2B (glutamate ionotropic receptor NMDA type subunit 2B) [NCBI Gene 2904] {aka DEE27, EIEE27, GluN2B, MRD6, NMDAR2B, NR2B}, Kras (Kras proto-oncogene, GTPase) [NCBI Gene 16653] {aka K-Ras, K-Ras 2, K-ras, Ki-ras, Kras-2, Kras2}, IL7R (interleukin 7 receptor) [NCBI Gene 3575] {aka CD127, CDW127, IL-7R-alpha, IL-7Ralpha, IL7RA, IL7Ralpha}, Mcam (melanoma cell adhesion molecule) [NCBI Gene 84004] {aka 1-gicerin, CD146, CD149, Muc18, s-endo, s-gicerin}, Kit (Kit proto-oncogene receptor tyrosine kinase) [NCBI Gene 16590] {aka Bs, CD117, Fdc, Gsfsco1, Gsfsco5, Gsfsow3}, Hoxb5 (homeobox B5) [NCBI Gene 15413] {aka Hox-2.1}, Nrp1 (neuropilin 1) [NCBI Gene 18186] {aka C530029I03, NP-1, NPN-1, Npn1, Nrp}, Tnf (tumor necrosis factor) [NCBI Gene 21926] {aka DIF, TNF-a, TNF-alpha, TNFSF2, TNFalpha, Tnfa}, Ifng (interferon gamma) [NCBI Gene 15978] {aka IFN-g, If2f, Ifg}, Cspg4 (chondroitin sulfate proteoglycan 4) [NCBI Gene 121021] {aka 4732461B14Rik, AN2, Cspg4a, NG2}, COL1A1 (collagen type I alpha 1 chain) [NCBI Gene 1277] {aka CAFYD, EDSARTH1, EDSC, OI1, OI2, OI3}, Slc22a27 (solute carrier family 22, member 27) [NCBI Gene 171405] {aka OAT6-L, OAT6-S, Oat9}, S100a6 (S100 calcium binding protein A6 (calcyclin)) [NCBI Gene 20200] {aka 2A9, 5B10, CALCYCLIN, Cacy, PRA}, S100A4 (S100 calcium binding protein A4) [NCBI Gene 6275] {aka 18A2, 42A, CAPL, FSP1, MTS1, P9KA}, Ier2 (immediate early response 2) [NCBI Gene 15936] {aka Ch1, Pip92}, Hoxb4 (homeobox B4) [NCBI Gene 15412] {aka Hox-2.6}, Cnot6 (CCR4-NOT transcription complex, subunit 6) [NCBI Gene 104625] {aka CCR4}, Phkg1 (phosphorylase kinase gamma 1) [NCBI Gene 18682] {aka Phkg}, IL36B (interleukin 36 beta) [NCBI Gene 27177] {aka FIL1, FIL1-(ETA), FIL1H, FILI-(ETA), IL-1F8, IL-1H2}, Cd34 (CD34 antigen) [NCBI Gene 12490]
- **Diseases:** UMAP (MESH:C567162), Cancer (MESH:D009369), melanoma (MESH:D008545), Inflammatory (MESH:D007249), Breast Cancer (MESH:D001943), Hypoxia (MESH:D000860), cardiovascular or degenerative disorders (MESH:D002318), myeloid leukemia (MESH:D007951), metastases (MESH:D009362), HSC (MESH:D019337)
- **Chemicals:** 1X (-), Calcein (MESH:C007740), DAPI (MESH:C007293), Calcein AM (MESH:C085925), magnesium (MESH:D008274), lipid (MESH:D008055), Calcein Green (MESH:C083098), calcium (MESH:D002118), Phenol red (MESH:D010637), oil (MESH:D009821), water (MESH:D014867), Trypan Blue (MESH:D014343), PBS (MESH:D007854), CO2 (MESH:D002245), Cholesterol (MESH:D002784), 3,3',5,5'-tetramethylbenzidine (MESH:C021758), Cy5.5 (MESH:C098793), DPBS (MESH:C012939), EDTA (MESH:D004492)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090], Rattus norvegicus (brown rat, species) [taxon 10116]
- **Cell lines:** NIH 3T3 — Mus musculus (Mouse), Spontaneously immortalized cell line (CVCL_0594), HEK 293 T — Homo sapiens (Human), Transformed cell line (CVCL_0063), C57BL/6 — Mus musculus (Mouse), Transformed cell line (CVCL_C0MU), ifn — Mus musculus (Mouse), Transformed cell line (CVCL_4245)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12881479/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12881479/full.md

---
Source: https://tomesphere.com/paper/PMC12881479