# Protocol for obtaining cancer type and subtype predictions using subSCOPE

**Authors:** Jasleen K. Grewal, A. Gordon Robertson, Kyle Ellrott, Christopher K. Wong, Jordan A. Lee, Christina Yau, Bahar Tercan, Mauro A.A. Castro, Christopher C. Benz, Theo A. Knijnenburg, Theo A. Knijnenburg, Mauro A.A. Castro, Vinicius S. Chagas, Victor H. Apolonio, Verena Friedl, Joshua M. Stuart, Vladislav Uzunangelov, Christopher K. Wong, Jesper B. Andersen, Andrew D. Cherniack, Galen F. Gao, Gad Getz, Stephanie H. Hoyt, Whijae Roh, Lindsay Westlake, Christopher C. Benz, Jasleen K. Grewal, Steven J.M. Jones, A. Gordon Robertson, Samantha J. Caesar-Johnson, John A. Demchok, Ina Felau, Anab Kemal, Roy Tarnuzzer, Zhining Wang, Liming Yang, Jean C. Zenklusen, Rehan Akbani, Bradley M. Broom, Zhenlin Ju, Andre Schultz, Akinyemi I. Ojesina, Katherine A. Hoadley, Avantika Lal, Daniele Ramazzotti, Chen Wang, Alexander J. Lazar, Lewis R. Roberts, Taek-Kyun Kim, Ilya Shmulevich, Bahar Tercan, Paulos Charonyktakis, Vincenzo Lagani, Ioannis Tsamardinos, Esther Drill, Ronglai Shen, Martin L. Ferguson, Kami E. Chiotti, Kyle Ellrott, Brian J. Karlberg, Jordan A. Lee, Eve Lowenstein, Adam Struck, Paul T. Spellman, Christina Yau, Toshinori Hinoue, Peter W. Laird, Jean C. Zenklusen, Andrew D. Cherniack, Peter W. Laird, Steven J.M. Jones

PMC · DOI: 10.1016/j.xpro.2025.103705 · 2025-04-10

## TL;DR

This paper introduces a protocol using subSCOPE to classify cancer types and subtypes from various -omics data.

## Contribution

A protocol is introduced for cancer subtype prediction using subSCOPE with multiple -omics data types.

## Key findings

- subSCOPE can classify cancer subtypes using five -omics data types.
- The protocol allows selection of specific cancer types and data types for prediction.
- It provides subtype-level classification for non-TCGA cancer samples across 26 cohorts and 106 subtypes.

## Abstract

We present a protocol for obtaining cancer type and subtype predictions using a machine learning method (subSCOPE). We describe steps for data preparation, subSCOPE setup, and running subSCOPE inference on prepared data. The protocol supports five -omics data types as input (DNA methylation, gene expression, microRNA [miRNA] expression, point mutations, and copy-number variants) and allows individual cancer type and data type selection. For non-The Cancer Genome Atlas (TCGA) cancer samples, it provides subtype-level classification across 26 different TCGA cancer cohorts and 106 subtypes.

For complete details on the use and execution of this protocol, please refer to Ellrott et al.1

•Classify -omics data into one of 106 subtypes across 26 human cancers with subSCOPE•Use gene expression, miRNA, mutation, copy-number variation, or methylation data•Specify choice of data types and cancer types for prediction if desired•Obtain confidence values associated with each prediction

Classify -omics data into one of 106 subtypes across 26 human cancers with subSCOPE

Use gene expression, miRNA, mutation, copy-number variation, or methylation data

Specify choice of data types and cancer types for prediction if desired

Obtain confidence values associated with each prediction

Publisher’s note: Undertaking any experimental protocol requires adherence to local institutional guidelines for laboratory safety and ethics.

We present a protocol for obtaining cancer type and subtype predictions using a machine learning method (subSCOPE). We describe steps for data preparation, subSCOPE setup, and running subSCOPE inference on prepared data. The protocol supports five -omics data types as input (DNA methylation, gene expression, microRNA [miRNA] expression, point mutations, and copy-number variants) and allows individual cancer type and data type selection. For non-The Cancer Genome Atlas (TCGA) cancer samples, it provides subtype-level classification across 26 different TCGA cancer cohorts and 106 subtypes.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Diseases:** Cancer (MESH:D009369)

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12013717/full.md

---
Source: https://tomesphere.com/paper/PMC12013717