# Effects of similarity networks in graph-based multi-omics classification

**Authors:** Masrafe Bin Hannan Siam, Md Rayhan Khan, Md Fazla Elahe, Md Shohel Arman, Swarna Akter, Junhuang Jiang, Junhuang Jiang, Tao Huang, Tao Huang

PMC · DOI: 10.1371/journal.pone.0344754 · PLOS One · 2026-03-19

## TL;DR

This paper evaluates different ways to build similarity networks for classifying diseases using multi-omics data and finds that Cosine Similarity performs best.

## Contribution

Systematic evaluation of six similarity network strategies for multi-omics classification, revealing Cosine Similarity's superior performance.

## Key findings

- Cosine Similarity outperforms other metrics in accuracy, F1-score, and AUC for disease classification.
- Cosine Similarity shows the lowest variance across cross-validation splits.
- Simple similarity measures like Cosine Similarity capture meaningful biological patterns better than complex methods.

## Abstract

Accurate classification of disease subtypes is a fundamental requirement of precision medicine especially for complex and heterogeneous conditions such as breast cancer and Alzheimer’s disease. Recent advances in graph-based deep learning have shown strong potential in multi-omics integration by modeling inter-sample relationships through similarity networks. Yet, the question of how best to construct these networks remains an open and underexplored challenge. In this work, we present a systematic evaluation of six distinct similarity network construction strategies including Cosine Similarity, Cosine Distance, RBF-based measures, and two hybrid combinations leveraging a graph convolutional network (GCN) integrated with a view correlation discovery network (VCDN) framework for multi-omics disease classification. Using two benchmark datasets (BRCA and ROSMAP), we assessed the impact of each method on classification performance, variance across runs, and statistical robustness. Surprisingly, our results demonstrate that Cosine Similarity outperforms all other metrics, consistently achieving the highest accuracy, F1-score, and AUC, while also showing the lowest standard deviation across cross-validation splits. Despite the growing popularity of kernel-based and hybrid similarity designs, our findings highlight the unique effectiveness of simple angular similarity in capturing biologically meaningful structure in high-dimensional omics data. In our study, we showed that simple yet biologically meaningful similarity measures like Cosine Similarity can outperform more complex techniques in accuracy, consistency, and clarity. This insight sets the stage for building more effective and interpretable graph-based models to support precision medicine.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989), Alzheimer’s disease (MONDO:0004975)

## Full-text entities

- **Genes:** BRCA1 (BRCA1 DNA repair associated) [NCBI Gene 672] {aka BRCAI, BRCC1, BROVCA1, FANCS, IRIS, PNCA4}, ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}
- **Diseases:** neurodegeneration (MESH:D019636), MOGONET (MESH:D015161), Cosine Distance (MESH:C535290), Breast Invasive Carcinoma (MESH:D001943), AD (MESH:D000544), liver cancer (MESH:D006528), Cancer (MESH:D009369), lung and colon cancer (MESH:D008175), RBF (MESH:D020425)
- **Chemicals:** GCN (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13001923/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13001923/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/PMC13001923/full.md

---
Source: https://tomesphere.com/paper/PMC13001923