# GeneHarmony: A Knowledge-Based Tool for Biomarker Discovery in Disease: Sjögren’s Disease vs. Rheumatoid Arthritis and Systemic Lupus Erythematosus

**Authors:** Micaela F. Beckman, Adam Alexander, Jean-Luc C. Mougeot, Farah Bahrani Mougeot

PMC · DOI: 10.3390/ijms26136379 · International Journal of Molecular Sciences · 2025-07-02

## TL;DR

This paper introduces GeneHarmony, a tool that helps find shared genes and pathways in autoimmune diseases like Sjögren’s Disease, Rheumatoid Arthritis, and Systemic Lupus Erythematosus.

## Contribution

The novel contribution is the development of GeneHarmony, a knowledge-based tool for analyzing gene-disease associations and identifying biomarkers across diseases.

## Key findings

- Seven genes (CD4, CD8A, IL6, IL17A, TNFS13B, TNF, TRIM21) were found common to Sjögren’s Disease, Rheumatoid Arthritis, and Systemic Lupus Erythematosus.
- STRINGv12 analysis identified significant KEGG pathways related to interleukin and cytokine signaling, and immune system processes.
- The tool simplifies data mining for gene associations and expression across diseases using databases like Diseasesv2.0 and GTExv8.

## Abstract

Sjögren’s Disease (SjD), Rheumatoid Arthritis (RA), and Systemic Lupus Erythematosus (SLE) are autoimmune diseases with overlapping genetic features, yet the etiologies of these diseases are poorly understood. Using these rheumatic diseases as an example of proof of concept, our aim was to develop a tool that simplifies analysis of gene–disease associations applicable to any disease and to perform comparisons. This tool is meant to provide insights into associated gene symbols and gene expression data to identify candidate biomarkers in common among these diseases. The Diseasesv2.0 and GTExv8 databases were utilized for data collection, providing searchable disease names, affiliated gene symbols, confidence scores (ranging from 0 to 5, with 5 being the most confident), and gene expression across the panel of 54 tissue types present in GTExv8. Data infrastructure was established on a Postgres database using Plotlyv5.17.0 and Streamlitv1.27.2 Python packages. The resulting database was used to investigate the genetic associations among SjD, RA, and SLE, including confidence scores from 2.50 to 5.00. STRINGv12 analysis determined significant pathways (FDR < 0.05). Analysis using our tool revealed the following refined gene associations for each disease: SjD based on ‘Sjogren’ search term (n = 12 genes), RA (n = 231 genes), and SLE (n = 137 genes). We found seven genes in common, namely, CD4, CD8A, IL6, IL17A, TNFS13B, TNF, and TRIM21. With the exception of IL17A, these genes were expressed in tissue types known or suggested to be affected by SjD. STRINGv12 determined significant KEGG pathways involving interleukin signaling, cytokine signaling, and the immune system. We developed a tool that simplifies the data mining process, allowing users to search for diseases of interest and view common gene associations and gene expression. Some of the genes identified through our tool may be further explored to better understand SjD pathogenesis and systemic impact.

## Linked entities

- **Genes:** CD4 (CD4 molecule) [NCBI Gene 920], CD8A (CD8 subunit alpha) [NCBI Gene 925], IL6 (interleukin 6) [NCBI Gene 3569], IL17A (interleukin 17A) [NCBI Gene 3605], TNF (tumor necrosis factor) [NCBI Gene 7124], TRIM21 (tripartite motif containing 21) [NCBI Gene 6737]
- **Diseases:** Rheumatoid Arthritis (MONDO:0008383), Systemic Lupus Erythematosus (MONDO:0007915)

## Full-text entities

- **Genes:** CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, TRIM21 (tripartite motif containing 21) [NCBI Gene 6737] {aka RNF81, RO52, Ro/SSA, SSA, SSA1, TRIM21/Ro52}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, IL17A (interleukin 17A) [NCBI Gene 3605] {aka CTLA-8, CTLA8, IL-17, IL-17A, IL17, ILA17}, IL6 (interleukin 6) [NCBI Gene 3569] {aka BSF-2, BSF2, CDF, HGF, HSF, IFN-beta-2}, TNF (tumor necrosis factor) [NCBI Gene 7124] {aka DIF, IMD127, TNF-alpha, TNFA, TNFSF2, TNLG1F}
- **Diseases:** SjD (MESH:D012859), rheumatic diseases (MESH:D012216), RA (MESH:D001172), autoimmune diseases (MESH:D001327), SLE (MESH:D008180)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12250006/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12250006/full.md

## References

57 references — full list in the complete paper: https://tomesphere.com/paper/PMC12250006/full.md

---
Source: https://tomesphere.com/paper/PMC12250006