TBCancer: A database for exploring characteristics and functions of tissue‐biased genes in cancer
Zhuobin Lin, Kunhua Hu, Hongyan Sun, Xiaoqiong Bao, Lin Tang, Wei Liu, Zhixiang Zuo, Zhihang Chen

TL;DR
This paper introduces a database of tissue-biased genes in cancer, showing how their inactivation contributes to cancer progression and could aid in developing new treatments.
Contribution
The novel contribution is the identification and integration of multi-omics data for 10,921 tissue-biased genes across 54 tissues and 41 cancers.
Findings
Tumor-associated tissue-biased genes are often downregulated, mutated, or epigenetically modified.
Inactivation of these genes promotes cancer by increasing stemness and immune evasion.
The database provides a resource for exploring these genes' roles in cancer mechanisms and therapies.
Abstract
This study defined 10,921 tissue‐biased genes across 54 normal tissues and 41 cancer types. Tumor‐associated tissue‐biased genes exhibit downregulation, mutations, and epigenetic modifications, correlating with poor clinical outcomes. Their inactivation promotes tumorigenesis by enhancing stemness and immune evasion, highlighting their value as prognostic biomarkers and therapeutic targets. To facilitate research, we developed a database integrating multi‐omics data on these genes for mechanistic and therapeutic exploration.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2- —Shenzhen Medical Research Fund
- —Guangdong Esophageal Cancer Institute Science and Technology Program
- —Guangzhou Municipal Institute Enterprise Joint Funding Project
- —National Natural Science Foundation of China10.13039/501100001809
- —Guangzhou Basic and Applied Basic Research Foundation
- —Operation Cost of Guangdong Provincial Key Laboratory of Guangdong Provincial Science and Technology Plan Project
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCRISPR and Genetic Engineering · RNA modifications and cancer · Epigenetics and DNA Methylation
To the Editor,
Tumors are classified by tissue of origin. There are profoundly different therapy responses in different tissues. Abnormalities of tissue‐biased genes and regulons are common features among cancer types [1]. These findings led to growing attention on tissue‐biased genes and promoted the development of related databases such TissGDB [2] and GENT [3]. However, existing databases presented limited information, including the clinical relevance analysis and the changes in gene expression. It was caused by limited quantity and types of datasets and insufficient function exploration of tissue‐biased genes in previous stages. Recent advances in genetic screening and multi‐omics technologies now facilitate more comprehensive characterization of these genes [4]. Genome‐wide CRISPR screenings have identified renal lineage factor PAX8 in kidney renal cell carcinoma cell lines [5]. CRISPR can also identify immune regulators through in vivo models. Therapy sensitivity data also facilitate the discovery of tumor‐associated tissue‐biased genes, such as the C‐X‐C motif chemokine ligands (CXCLs)/C‐X‐C motif chemokine receptors (CXCRs) axis mediating chemoresistance in several cancer types [6]. Systematic multi‐omics analyses on finding common genomic features and mechanisms of tissue‐biased genes are important but lacking. Moreover, there is few single‐cell research focusing on tissue‐biased genes, which helps uncover their function on tumor evolution and immune environment. Therefore, it is urgent to build a new database to meet the research needs.
To bridge this gap, we built up the genomic and transcriptomic profiles of tissue‐biased genes across 54 normal tissues and 41 cancer types at the bulk level, and revealed their function in cancer transitions based on single‐cell RNA sequencing data. Tissue‐biased genes showed lower expression, more mutations, and higher methylation level. Single‐cell analysis found the loss of tissue‐biased genes in cancer stem cells, related to immune escape mediated by regulatory T cells (Treg). These features are common in cancer types. We further screened out four liver tissue‐biased genes highly associated with stemness and validated their function by in vitro experiments, which had potential value as therapeutic targets. Based on our findings, we developed a pan‐cancer tissue‐biased and cancer‐specific genes atlas (TBCancer). TBCancer integrates transcriptomics, proteomics, cell line therapy, CRISPR‐screening and immune‐screening data, and provides the correlation of tissue‐biased genes with cancer cell stemness and immune infiltration. Our study provided a comprehensive landscape of tissue‐biased genes in cancer and revealed that tissue‐biased genes could be essential for restricting cancer cell transition by regulating stemness and immune crosstalk.
RESULTS AND DISCUSSION
1
Pan‐cancer multi‐omics analysis reveals tissue‐biased gene patterns
1.1
In this study, we defined tissue‐biased genes as those exhibiting at least a two‐fold higher expression level in one tissue compared to all other tissues, with statistically significant differences. In diverse cancer types, tissue‐biased genes were significantly downregulated in corresponding tumors (Figure 1A, Figure S1A), which are enriched in cancer‐driving pathways including Kirsten rat sarcoma viral oncogene homolog (KRAS) and cell cycle pathways (Figure 1B). There were frequent genomic and epigenomic alterations on tissue‐biased genes, correlating with abnormal expression patterns in 5%–50% of these genes (Figure S1B). High expression of tissue‐biased genes was associated with improved overall survival (OS) and progression‐free survival (PFS) in liver cancer, kidney cancer, and lung cancer (Figure 1C, Figure S1C−E). Moreover, the patients in the late clinical and pathological stage showed decreased expression of tissue‐biased genes (Figure S1F). These results highlighted the clinical importance of tissue‐biased genes in tumors.
*The characteristics and functions of tissue‐biased genes in cancer. (A) Heatmap of log2‐transformed fold changes (log2FC) of tissue‐biased gene expression level between normal tissue adjacent to the tumor (NAT) and tumor tissues. The abbreviations of cancer types in TCGA included adrenocortical cancer (ACC), bladder urothelial carcinoma (BLCA), brain lower grade glioma (LGG), glioblastoma multiforme (GBM), breast invasive carcinoma (BRCA), cervical and endocervical cancer (CESC), colon adenocarcinoma (COAD), esophageal carcinoma (ESCA), kidney chromophobe (KICH), kidney clear cell carcinoma (KIRC), kidney papillary cell carcinoma (KIRP), liver hepatocellular carcinoma (LIHC), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), ovarian serous cystadenocarcinoma (OV), pancreatic adenocarcinoma (PAAD), prostate adenocarcinoma (PRAD), skin cutaneous melanoma (SKCM), stomach adenocarcinoma (STAD), testicular germ cell tumor (TGCT), thyroid carcinoma (THCA), and uterine carcinosarcoma (UCS). (B) Enrichment of differentially expressed tissue‐biased in HALLMARK pathways (p < 0.05, hypergeometric test). (C) Stratification of TCGA LIHC cohorts with tissue‐biased gene score (median cut‐off) for overall survival (p < 0.05, log‐rank test). (D) Uniform Manifold Approximation and Projection (UMAP) plot of cancer cell types annotated by unique colors in liver cancer. (E) UMAP plot showing learned trajectories by monocle in cancer cell types colored by pseudotime ordering. (F) The tissue‐biased (TB) gene score and stemness score of cancer cell types at different evolutionary stages. (G) Venn diagram showing shared genes between genes significantly downregulated in LIHC tissues and genes significantly downregulated in LCSC cluster. (H) The immune infiltration fraction of liver cancer patients with high or low expression levels of liver tissue‐biased genes calculated by CIBERSORT (p < 0.05, two‐tailed independent t‐test). (I) The significant ligand‐receptor interactions between liver cancer cells and immune cells modeled by CellChat (p < 0.05). (J) Volcano plot showing tissue‐biased genes related to cancer cell stemness at both the bulk level and the single cell level. The x‐axis represented the strength of correlation between the expression level of tissue‐biased genes and stemness genes, and the vertical dashed lines indicated the thresholds for statistically significant correlation. The y‐axis represented the expression changes of tissue‐biased genes, and the horizontal dash lines indicated the thresholds for statistically significant changes. The y‐axis data were standardized to a 0–1 scale for better visual representation. Top five candidate genes and the corresponding dots were marked in different colors, and the other dots were marked in gray. (K) The expression level of PCK1 between NAT and LIHC samples (left), and among LIHC samples with different PCK1 methylation levels (right) (Benjamini‐Hochberg adjusted p‐value < 0.05, two‐tailed independent t‐test). (L) Cell proliferation of wild‐type Huh‐7 cells (Control, Ctrl) and Huh‐7 cells separately transfected with overexpression plasmids (OE) of HSD17B13, CYP2C8, ADH4, and PCK1 measured by CCK‐8 assay. Cell migration of control and OE Huh‐7 cells measured by transwell assay (M) and scratch assay (N). (O) Western blot analysis for stemness‐related proteins in control and OE‐PCK1 Huh‐7 cells. For the in vitro assay data (L−O), Mann–Whitney U tests (p < 0.05) were performed for pairwise comparisons between Control (n = 3) and OE groups (n = 3) at each time point. (P) Schematic diagram illustrating the interplay between progressive loss of tissue‐biased genes, elevated cancer stemness, and immune evasion in tumor evolution. The abbreviations in the diagram included epithelial‐mesenchymal transition (EMT) and tumor microenvironment (TME). Statistical significance thresholds were set at ns not significant, *p < 0.05, **p < 0.01, ***p < 0.001, and ***p < 0.0001.
Single‐cell analysis reveals the changes and functions of tissue‐biased genes during tumor evolution
1.2
Next, we conducted single‐cell analysis to reveal the alteration and function of tissue‐biased genes during tumor evolution. To provide a clear demonstration, we chose liver hepatocellular carcinoma (LIHC) as a representative example, while our single‐cell analyses of diverse cancer types consistently supported these observations (Figures S2 and S3). Pseudotime analysis revealed two distinct evolutionary trajectories of tumor cells, both originating from normal hepatocytes. One trajectory differentiated further into the cell cluster highly expressing NAD(P)H:quinone oxidoreductase 1 (NQO1) and liver cancer stem cell (LCSC) clusters, while the other developed into cell clusters expressing fatty acid binding protein 1 (FABP1), apolipoprotein A2 (APOA2), heat shock protein family A member 1 A (HSPA1A), and stathmin 1 (STMN1) (Figures 1D,E, and S2A–C). We calculated a tissue‐biased gene score for each cell to compare the expression level of tissue‐biased genes among tumor cell clusters (see Materials and Methods). The results showed gradual inactivation of tissue‐biased genes during tumor cell evolution, especially in the LCSC cluster located at the end of the tumor evolution trajectory (Figure 1F, Figure S2D,E). Moreover, tissue‐biased gene scores were negatively correlated with tumor stemness scores across tumor cell clusters (Figure S2F) [7]. There was a considerable consistency between downregulated genes identified through bulk analysis and those detected at single‐cell resolution by comparing LCSC and other cell clusters (Figure 1G). Notably, several genes identified in our analysis, including HP, TTR, APOH, HPX, and SAA1, have been previously implicated in the regulation of tumor stemness [8] (Table S1). Acquisition of stemness is associated with the oncogenic dedifferentiation. Tissue‐biased genes showed activation during embryo development but silencing in cancer progression, implicating their functional loss facilitated epithelial transformation and stemness acquisition (Figure S2G−H).
Additionally, our analysis revealed that tissue‐biased genes participate in immune regulation. Immune infiltration analysis showed that patients with lower tissue‐biased gene expression have decreased infiltration of B cell and macrophage, and increased infiltration of Treg (Figure 1H). Single‐cell analysis showed more receptor‐ligand pairs between LCSC and Treg compared to other cancer cell clusters, which included midkine (MDK), nucleolin (NCL), cytotoxic T‐lymphocyte associated protein 4 (CTLA4), and CXCR known to affect Treg proliferation and maturation [9] (Figure 1I). We also observed activated receptor‐ligand pairs between LCSC and other immune cells, including B cell, CD4 + T cell, M1 macrophage, and M2 macrophage (Figure S4). These results indicated that the loss of tissue‐biased genes might impair the immune surveillance on LCSC. Pioneering investigations revealed that the crosstalk between tumor stemness pathway and immune response modulated cancer progression. Our results indicated that tissue‐biased genes might participate in these mechanisms.
Herein, we supposed that a subset of epithelial cells may lose function of tissue‐biased genes, gain stem cell‐like features, and escape immune surveillance, contributing to cancer cell proliferation and metastasis. To screen for tissue‐biased genes that are highly correlated with stemness, we calculated the correlation between the expression of tissue‐biased genes and stemness genes at both bulk and single‐cell level datasets (Figure 1J, Table S2). Based on our selection criteria, four top candidate genes were identified for further analysis and experimental validation. In The Cancer Genome Atlas (TCGA) LIHC cohort, PCK1, ADH4, HSD17B13, and CYP2C8 were significantly downregulated in tumor and could be affected by DNA methylation and copy number variation (CNV) alteration (Figure 1K, Figure S5). These were common features of tissue‐biased genes associated with tumor. Experiments showed that individual overexpression of PCK1, ADH4, HSD17B13, and CYP2C8 decreased the proliferation ability and migration ability of liver cancer cells (Figure 1L−N, Figure S5). Furthermore, we observed decreased expression of stemness genes in tumor cells transfected with ADH4 or PCK1 plasmid (Figure 1O). These results further validated the roles of tissue‐biased genes in tumor evolution (Figure 1P).
TBCancer: A comprehensive database for tissue‐biased and cancer‐specific genes
1.3
Based on the above exploration, we developed a pan‐caner tissue‐biased and cancer‐specific genes atlas (TBCancer) for better understanding on tissue‐biased genes and for screening candidate biomarkers and therapeutic targets. TBCancer can be accessed at http://tsbcancer.canceromics.org/. More than 10,000 tissue‐biased genes from 54 normal tissues were included in TBCancer. Their features and functions in cancers were analyzed by integrating multi‐omics data and clinical information of the large‐scale pan‐cancer cohorts from TCGA [10] and Gene Expression Omnibus (GEO) repository [11] (Figure 2A,B).
Overall design and construction of TBCancer. (A) The construction pipeline and structure of TBCancer database. (B) The browse and detail pages including six analysis modules in TBCancer database.
The Browser page provides the data source information and an overview of tissue‐biased genes from diverse cancer types. Users can select specific tissue/cancer types and available data types to generate a gene list of interest, then click the database ID to obtain detailed gene information. Alternatively, users can directly search for genes of interest on the Search page. The detailed page provides basic gene information, gene related pathways, and six analysis modules (Figure 2A,B, Figure S6). Expression summary visualized gene expression specificity across normal and tumor tissue types. Differential gene expression analysis revealed the changes of tissue‐biased genes across cancer types, with the downregulation of genes in their matched cancers being particularly noteworthy. These tissue‐biased and cancer‐specific genes deserved further analysis. Survival analysis was performed using the Kaplan‐Meier method based on clinical information (OS and PFS) of TCGA cohort datasets. Based on our analysis, we highlighted the importance of tissue‐biased genes in immune regulation. In TBCancer, the correlation between gene and immune infiltration in cancers was presented through four methods including ssGSEA, CIBERSORT, TIMER, and MCP counter. This module helps to find out immune‐related genes and their interactions with immune cells. Major advantages offered by TBCancer are plenty of therapy sensitivity data and CRISPR screening data, which are limited in other databases. It displayed the correlation between gene and chemotherapy and the correlation between gene and immunotherapy, through comparing the gene expression level and survival time between responder and nonresponder patients. The gene expression level at cell lines treated with drug at half‐maximal response dose (IC_50_) also provided clues to the mechanisms of tissue‐biased genes. Additionally, we gathered CRISPR screening datasets from the Cancer Cell Line Encyclopedia [12], Catalogue Of Somatic Mutations In Cancer [13], and GEO [11]. This massively parallel analysis helps to find out genes affecting cancer cell proliferation and immune cell phenotype.
Integrative analysis using TBCancer helps to identify key cancer‐associated tissue‐biased genes, with functional insights provided by single‐cell immune profiling through our TIGER database resource [14]. Taking HSD17B13 as an example, TBCancer showed its specificity in liver tissue, downregulation in liver cancer, and the association between higher expression and better prognosis. Treatment with Sorafenib showed increased expression of HSD17B13, and the negative correlation between HSD17B13 expression and MP470 IC_50_ implied that this gene may serve as a biomarker for enhanced drug sensitivity. Additionally, its expression is positively correlated with neutrophil infiltration. Single‐cell analysis through TIGER showed its enrichment in tumor cells and its loss in evolutionary trajectories, and more cell–cell interactions between tumor cells and myeloid cells. The co‐expression analysis helps to find out related genes to tissue‐biased genes of interest, such as stemness genes and immune genes we have described and other important genes in cancer.
CONCLUSION
2
In this study, we characterized common features and mechanisms of tissue‐biased genes in cancer through systematic multi‐omics analyses: (1) tissue‐biased genes were significantly decreased in cancer, a phenomenon at least partially driven by CNV and DNA methylation; (2) the distribution and alteration of tissue‐biased genes were specific in epithelial cells and cancer cells; (3) the loss of tissue‐biased genes might function on cancer cell stemness and immune escape during tumorigenesis. Based on our findings and to address current database limitations, we developed the TBCancer database, providing the users with visual interface, succinct and effective modules, and easy operation.
These findings not only highlight the underappreciated roles of tissue‐biased genes in tumor biology but also establish TBCancer as an essential resource for the emerging field.
METHODS
3
Detailed procedures for experimental methods, data analysis and statistical methodology, and TBCancer database construction are available in the Supplementary Information.
AUTHOR CONTRIBUTIONS
Zhuobin Lin: Methodology; visualization; project administration; writing—original draft. Kunhua Hu: Funding acquisition; project administration; methodology; writing—original draft. Hongyan Sun: Validation; methodology; project administration. Xiaoqiong Bao: Project administration; software. Lin Tang: Project administration; software. Wei Liu: Supervision; conceptualization; funding acquisition; writing—review and editing. Zhixiang Zuo: Conceptualization; supervision; funding acquisition; writing—review and editing. Zhihang Chen: Conceptualization; supervision; writing—review and editing; validation; visualization.
CONFLICT OF INTEREST STATEMENT
The authors declare no conflicts of interest.
ETHICS STATEMENT
No animals or humans were involved in this study.
Supporting information
Figure S1. Pan‐cancer profiles of tissue‐biased genes in tumors. Figure S2. Single‐cell analysis revealed dramatic loss of tissue‐biased genes during tumor evolution in LIHC. Figure S3. Single‐cell analysis on tissue‐biased genes in NSCLC, KIRP, and CRC. Figure S4. Abnormal expression of tissue‐biased genes affects immune crosstalk in tumor. Figure S5. The inactivation of tissue‐biased genes in liver cancer enhances tumor stemness and affects tumor proliferation and metastatic ability. Figure S6. The function and usage of TBCancer.
Table S1. Intersection of genes downregulated in LCSC relative to hepatocyte and tissue‐biased genes downregulated in liver tumor relative to normal tissue adjacent to the tumor. Table S2. List of liver‐biased genes highly associated with cancer cell stemness. Table S3. The reference gene set for tumor cell stemness scores. Table S4. List of the primers used in this study. Table S5. List of the data resources that are integrated by TBCancer.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Uhlen, Mathias , Cheng Zhang , Sunjae Lee , Evelina Sjöstedt , Linn Fagerberg , Gholamreza Bidkhori , Rui Benfeitas , et al. 2017. “A Pathology Atlas of the Human Cancer Transcriptome.” Science 357: eaan 2507. 10.1126/science.aan 2507 28818916 · doi ↗ · pubmed ↗
- 2Kim, Pora , Aekyung Park , Guangchun Han , Hua Sun , Peilin Jia , and Zhongming Zhao . 2018. “Tiss GDB: Tissue‐Specific Gene Database in Cancer.” Nucleic Acids Research 46: D 1031–D 1038. 10.1093/nar/gkx 850 29036590 PMC 5753286 · doi ↗ · pubmed ↗
- 3Park, Seung‐Jin , Byoung‐Ha Yoon , Seon‐Kyu Kim , and Seon‐Young Kim . 2019. “GENT 2: An Updated Gene Expression Database for Normal and Tumor Tissues.” BMC Medical Genomics 12: 101. 10.1186/s 12920-019-0514-7 31296229 PMC 6624177 · doi ↗ · pubmed ↗
- 4Hsu, Patrick D. , Eric S. Lander , and Feng Zhang . 2014. “Development and Applications of CRISPR‐Cas 9 for Genome Engineering.” Cell 157: 1262–1278. 10.1016/j.cell.2014.05.010 24906146 PMC 4343198 · doi ↗ · pubmed ↗
- 5Patel, Saroor A. , Shoko Hirosue , Paulo Rodrigues , Erika Vojtasova , Emma K. Richardson , Jianfeng Ge , Saiful E. Syafruddin , et al. 2022. “The Renal Lineage Factor PAX 8 Controls Oncogenic Signalling in Kidney Cancer.” Nature 606: 999–1006. 10.1038/s 41586-022-04809-8 35676472 PMC 9242860 · doi ↗ · pubmed ↗
- 6Tang, Lei , Xin Chen , Jianquan Hou , and Xuedong Wei . 2025. “CXCL 14 in Prostate Cancer: Complex Interactions in the Tumor Microenvironment and Future Prospects.” Journal of Translational Medicine 23: 9. 10.1186/s 12967-024-06022-9 39755616 PMC 11700453 · doi ↗ · pubmed ↗
- 7Takahashi, Kazutoshi , and Shinya Yamanaka . 2006. “Induction of Pluripotent Stem Cells From Mouse Embryonic and Adult Fibroblast Cultures by Defined Factors.” Cell 126: 663–676. 10.1016/j.cell.2006.07.024 16904174 · doi ↗ · pubmed ↗
- 8Apostolou, Panagiotis , Maria Toloudi , and Ioannis Papasotiriou . 2015. “Identification of Genes Involved in Breast Cancer and Breast Cancer Stem Cells.” Breast Cancer 7: 183–191. 10.2147/BCTT.S 85202 26203276 PMC 4507490 · doi ↗ · pubmed ↗
