# SeOMLR: one-step multi-view latent representation with self-weighted ensemble learning for multi-omics cancer subtyping

**Authors:** Wenjing Song, Yesen Sun, Le Ou-Yang

PMC · DOI: 10.1093/bioinformatics/btag074 · Bioinformatics · 2026-03-05

## TL;DR

This paper introduces seOMLR, a new method for cancer subtyping that improves accuracy by better integrating multi-omics data.

## Contribution

The novel one-step framework seOMLR enhances cancer subtyping by combining latent representation and self-weighted ensemble learning.

## Key findings

- seOMLR outperforms existing methods in subtyping accuracy on TCGA datasets.
- The method avoids information loss by using spectral rotation for clustering.
- Experiments show efficient multi-omics data fusion with improved stability.

## Abstract

Accurate cancer subtyping is critically important for cancer treatment due to significant molecular heterogeneity. While existing methods with multi-omics integration have achieved some success in cancer subtype identification by leveraging the rich information provided by multi-omics data, most approaches remain limited by an overemphasis on cross-omics consistency at the expense of intra-omics specificity. Furthermore, a two-step scheme is often adopted to extract cluster structure from a consistency matrix or a continuous indicator matrix by k-means, which inevitably leads to information loss and unstable clusters.

To overcome these issues, we propose seOMLR, a one-step multi-view latent representation method with self-weighted ensemble learning for cancer subtyping. Using relaxed exclusivity constraints and consistency regularization terms, seOMLR exploits the specificity and consistency of multi-omics data by building a sparse low-rank self-representation framework. Simultaneously, a self-weighted ensemble strategy is introduced to adaptively incorporate prior subtyping information from other methods, indirectly promoting specificity and consistency learning. Moreover, the discrete clustering structure is subsequently extracted via spectral rotation to avoid information loss and cluster instability. Through joint iterative optimization of fusion and clustering, seOMLR enhances subtyping accuracy. Experiments on both simulated datasets and eight real multi-omics cancer datasets from TCGA demonstrate that seOMLR outperforms competing methods, achieving efficient multi-omics data fusion and providing computational framework support for cancer subtyping research.

Supplementary data are available at Bioinformatics online.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Diseases:** cancer (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12980331/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12980331/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/PMC12980331/full.md

---
Source: https://tomesphere.com/paper/PMC12980331