# Branch-specific gene discovery in cell differentiation using multi-omics graph attention

**Authors:** Yihao Yin, Linzhi Zhuang, Yulei Wang, Yazhou Shi, Bengong Zhang

PMC · DOI: 10.1371/journal.pcbi.1013664 · 2025-11-03

## TL;DR

This paper introduces BranchKGN, a new method that uses multi-omics data to identify key genes involved in cell differentiation and reconstruct gene regulatory networks.

## Contribution

The novel contribution is a graph attention-based framework for identifying branch-specific key genes in cell differentiation using scRNA-seq and scATAC-seq data.

## Key findings

- BranchKGN successfully identifies key genes at bifurcation points in cell differentiation trajectories.
- The identified genes support accurate reconstruction of differentiation trajectories across multiple datasets.

## Abstract

Understanding gene regulation during cell differentiation requires effective integration of multi-omics single-cell data. In this study, we propose BranchKGN, a heterogeneous graph transformer-based framework for identifying branch-specific key genes along cell differentiation trajectories. By integrating scRNA-seq and scATAC-seq data into a unified gene representation, we infer differentiation trajectories using Slingshot and construct a heterogeneous graph capturing gene–cell relationships. Through attention-based graph learning, BranchKGN assigns gene importance scores within each cell, enabling the identification of genes consistently informative across branch point cells and their descendant lineages. These genes are then used to reconstruct gene regulatory networks and differentiation trajectories. Validation on three independent datasets demonstrates that the identified gene sets not only capture key regulators of cell fate bifurcation but also support accurate reconstruction of differentiation trajectories. Our results highlight the effectiveness of BranchKGN in dissecting gene regulation dynamics during cellular transitions and provide a valuable tool for multi-omics single-cell analysis.

Cell fate is an important biology process during biological development, tissue regeneration, and disease progression. However, the modes of cell differentiation and the patterns of gene expression changes during this process are still not so clear. To investigate these problems, we introduce BranchKGN, a framework based on graph attention mechanisms designed to identify branch-specific key genes along cell differentiation trajectories. By integrating scRNA-seq and scATAC-seq data and inferring differentiation trajectories using Slingshot, BranchKGN employs multi-head attention learning to score the importance of genes within each cell. And then branch-specific key genes at bifurcation points are identified. These genes are used to reconstruct gene regulatory networks and differentiation trajectories. We test BranchKGN on three independent datasets, can identify specific key gene sets at cell fate bifurcations. These key genes can accurately reconstruct the cell differentiation trajectories in turn. Our results highlight that BranchKGN is a powerful tool to dissect gene regulatory dynamics during cell transitions and analyze the multi-omics single-cell data.

## Full-text entities

- **Genes:** ANAPC16 (anaphase promoting complex subunit 16) [NCBI Gene 119504] {aka APC16, C10orf104, CENP-27, MSAG, bA570G20.3}, PAGR1 (PAXIP1 associated glutamate rich protein 1) [NCBI Gene 79447] {aka C16orf53, GAS, PA1}, LYST (lysosomal trafficking regulator) [NCBI Gene 1130] {aka CHS, CHS1, Mauve}, HOPX (HOP homeobox) [NCBI Gene 84525] {aka CAMEO, HOD, HOP, LAGY, NECC1, OB1}, FZD3 (frizzled class receptor 3) [NCBI Gene 7976] {aka Fz-3}, USP3 (ubiquitin specific peptidase 3) [NCBI Gene 9960] {aka SIH003, UBP}, UBE2D2 (ubiquitin conjugating enzyme E2 D2) [NCBI Gene 7322] {aka E2(17)KB2, PUBC1, UBC4, UBC4/5, UBCH4, UBCH5B}, LIFR (LIF receptor subunit alpha) [NCBI Gene 3977] {aka CD118, LIF-R, SJS2, STWS, SWS}, TNC (tenascin C) [NCBI Gene 3371] {aka 150-225, DFNA56, GMEM, GP, HXB, JI}, FAM107A (family with sequence similarity 107 member A) [NCBI Gene 11170] {aka DRR1, TU3A}, PTPRZ1 (protein tyrosine phosphatase receptor type Z1) [NCBI Gene 5803] {aka HPTPZ, HPTPzeta, PTP-ZETA, PTP18, PTPRZ, PTPZ}, USP1 (ubiquitin specific peptidase 1) [NCBI Gene 7398] {aka UBP}, CHD3 (chromodomain helicase DNA binding protein 3) [NCBI Gene 1107] {aka Mi-2a, Mi2-ALPHA, SNIBCPS, ZFH}, ARHGAP26 (Rho GTPase activating protein 26) [NCBI Gene 23092] {aka GRAF, GRAF1, OPHN1L, OPHN1L1}, SIPA1L1 (signal induced proliferation associated 1 like 1) [NCBI Gene 26037] {aka E6TP1, SPAR1}, MAML2 (mastermind like transcriptional coactivator 2) [NCBI Gene 84441] {aka MAM-3, MAM2, MAM3, MLL-MAML2}
- **Chemicals:** oRG-111 (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

28 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12594343/full.md

---
Source: https://tomesphere.com/paper/PMC12594343