# Machine Learning in Biomarker-Driven Precision Oncology: Automated Immunohistochemistry Scoring and Emerging Directions in Genitourinary Cancers

**Authors:** Matthew Yap, Ioana-Maria Mihai, Gang Wang

PMC · DOI: 10.3390/curroncol33010031 · 2026-01-06

## TL;DR

This paper reviews how machine learning improves the consistency of immunohistochemistry scoring in cancer diagnosis and explores its potential in genitourinary cancers.

## Contribution

The paper highlights the application of machine learning to emerging biomarkers in genitourinary cancers, supporting precision oncology.

## Key findings

- Machine learning improves consistency and scalability in scoring established biomarkers like ER/PR and PD-L1.
- ML algorithms show early success in quantifying emerging biomarkers in genitourinary cancers, linking them to clinical outcomes.
- Challenges include limited training data and variability in staining protocols.

## Abstract

Immunohistochemistry (IHC) is a common test used by pathologists to detect cancer biomarkers, which can help with diagnosis, prognosis, and treatment selection. However, IHC results can vary between laboratories and between observers. New digital pathology tools and artificial intelligence (AI), particularly machine learning (ML) techniques, can analyse stained tissue more consistently. This review gives an overview of how ML is being used to automate IHC scoring, first in well-studied biomarkers and then emerging biomarkers. This review then explores how these innovations can apply to genitourinary (GU) oncology, including prostate, renal, and bladder tumours, for which researchers have begun applying ML to new biomarkers that may predict outcomes or treatment response. ML use in IHC scoring is promising but requires more validation.

Immunohistochemistry (IHC) is essential for diagnostic, prognostic, and predictive biomarker assessment in oncology, but manual interpretation is limited by subjectivity and inter-observer variability. Machine learning (ML), a computational subset of AI that allows algorithms to recognise patterns and learn from annotated datasets to make predictions or decisions, has led to advancements in digital pathology by supporting automated quantification of biomarker expression on whole-slide images (WSIs). This review evaluates the role of ML-assisted IHC scoring in the transition from validated biomarkers to the discovery of emerging prognostic and predictive IHC biomarkers for genitourinary (GU) tumours. Current applications include ML-based scoring of routinely used biomarkers such as ER/PR, HER2, mismatch repair (MMR) proteins, PD-L1, and Ki-67, demonstrating improved consistency and scalability. Emerging studies in GU cancers show that algorithms can quantify markers including androgen receptor (AR), PTEN, cytokeratins, Uroplakin II, Nectin-4 and immune checkpoint proteins, with early evidence indicating associations between ML-derived metrics and clinical outcomes. Important limitations remain, including limited availability of training datasets, variability in staining protocols, and regulatory challenges. Overall, ML-assisted IHC scoring is a reproducible and evolving approach that may support biomarker discovery and enhance precision GU oncology.

## Linked entities

- **Proteins:** EREG (epiregulin), PGR (progesterone receptor), ERBB2 (erb-b2 receptor tyrosine kinase 2), CD274 (CD274 molecule), Mki67 (antigen identified by monoclonal antibody Ki 67), PTEN (phosphatase and tensin homolog), NECTIN4 (nectin cell adhesion molecule 4)
- **Diseases:** cancer (MONDO:0004992), renal tumours (MONDO:0021163), bladder tumours (MONDO:0004987)

## Full-text entities

- **Genes:** EREG (epiregulin) [NCBI Gene 2069] {aka EPR, ER, Ep}, NECTIN4 (nectin cell adhesion molecule 4) [NCBI Gene 81607] {aka EDSS1, LNIR, PRR4, PVRL4, nectin-4}, UPK2 (uroplakin 2) [NCBI Gene 7379] {aka UP2, UPII}, CD274 (CD274 molecule) [NCBI Gene 29126] {aka ADMIO5, B7-H, B7H1, PD-L1, PDCD1L1, PDCD1LG1}, ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}, PGR (progesterone receptor) [NCBI Gene 5241] {aka NR3C3, PR}, AR (androgen receptor) [NCBI Gene 367] {aka AIS, AR8, DHTR, HPCX3, HUMARA, HYSP1}, PTEN (phosphatase and tensin homolog) [NCBI Gene 5728] {aka 10q23del, BZS, CWS1, DEC, GLM2, MHAM}
- **Diseases:** GU cancers (MESH:D014565)

---
Source: https://tomesphere.com/paper/PMC12840502