# An effective encoding of human medical conditions in disease space provides a versatile framework for deciphering disease associations

**Authors:** Tianxin Xu, Yu Li, Xin Gao, Andrey Rzhetsky, Gengjie Jia

PMC · DOI: 10.1002/qub2.93 · Quantitative Biology · 2025-03-11

## TL;DR

This paper introduces a new framework using disease embeddings to better understand disease associations and comorbidities in a unified way.

## Contribution

The novel contribution is a systems biology approach using embedding algorithms to map diseases into a high-dimensional space for analyzing disease associations.

## Key findings

- Disease embeddings enable quantitative measurement of disease similarity in a high-dimensional space.
- The framework supports downstream analyses like discovering hidden disease associations and improving genetic parameter estimation.
- The method facilitates data-driven disease classification and genetic association studies considering comorbidities.

## Abstract

It is challenging to identify comorbidity patterns and mechanistically investigate disease associations based on health‐related data that are often sparse, large‐scale, and multimodal. Adopting a systems biology approach, embedding‐based algorithms provide a new perspective to examine diseases under a unified framework by mapping diseases into a high‐dimensional space as embedding vectors. These vectors and their constituted disease space encode pathological information and enable a quantitative and systemic measurement of the similarity between any pair of diseases, opening up an avenue for numerous types of downstream analyses. Here, we exemplify its potential through applications in discovering hidden disease associations, assisting in genetic parameter estimation, facilitating data‐driven disease classifications, and transforming genetic association studies of diseases in consideration of comorbidities. While underscoring the power and versatility of this approach, we also discuss the challenges posed by medical context, requirements of online training and result validation, and research opportunities in constructing foundation models from multimodal disease data. With continued innovation and exploration, disease embedding has the potential to transform the fields of disease association analysis and even pathology studies by providing a holistic representation of patient health status.

## Full-text entities

- **Diseases:** diseases (MESH:D004194)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12806074/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12806074/full.md

## References

67 references — full list in the complete paper: https://tomesphere.com/paper/PMC12806074/full.md

---
Source: https://tomesphere.com/paper/PMC12806074