# HDAnalyzeR: streamlining data analysis for biomarker research

**Authors:** Konstantinos Antonopoulos, Emil Johansson, Josefin Kenrick, Leo Dahl, Fredrik Edfors, Mathias Uhlén, María Bueno Álvez

PMC · DOI: 10.1093/bioadv/vbag020 · 2026-01-23

## TL;DR

HDAnalyzeR is an R package that simplifies and unifies the analysis of large biological datasets, improving efficiency and reproducibility in biomarker research.

## Contribution

HDAnalyzeR introduces a modular, user-friendly R package that streamlines high-dimensional biological data analysis and supports reproducible workflows.

## Key findings

- HDAnalyzeR reduced analysis time and code complexity in case studies.
- The package achieved blood cancer classification with AUC = 1.0.
- It identified thousands of solid tumor-associated genes.

## Abstract

Exploration of large-scale biological datasets remains a central challenge in computational biology. While many tools are available, they are often developed in isolation, leading to fragmented workflows, duplicated efforts, and limited reproducibility. There is a pressing need for flexible, standardized solutions that unify exploratory data analysis and biomarker discovery across diverse platforms.

We present HDAnalyzeR, a user-friendly and extensible R package for the streamlined analysis of high-dimensional biological data. HDAnalyzeR provides modular, reproducible workflows that support a range of analyses, from quality control and dimensionality reduction to differential expression and enrichment analysis. The package features built-in visualization, metadata-aware modeling, and seamless integration with interactive apps and learning resources. We also present two case studies, where HDAnalyzeR dramatically reduced analysis time and code complexity while providing biologically meaningful insights, such as classification of blood cancer types with AUC = 1.0 and identification of thousands of solid tumor-associated genes. HDAnalyzeR is designed to support both beginner users and experienced bioinformaticians, promoting transparency, reproducibility, and publication-quality output.

HDAnalyzeR is freely available both as an open-source R package at https://github.com/kantonopoulos/HDAnalyzeR and a web application at https://hdanalyzer.serve.scilifelab.se.

## Linked entities

- **Diseases:** blood cancer (MONDO:0002334)

## Full-text entities

- **Genes:** SLIT2 (slit guidance ligand 2) [NCBI Gene 9353] {aka SLIL3, Slit-2}, JAM2 (junctional adhesion molecule 2) [NCBI Gene 58494] {aka C21orf43, CD322, IBGC8, JAM-B, JAMB, PRO245}, CD22 (CD22 molecule) [NCBI Gene 933] {aka SIGLEC-2, SIGLEC2}, PRAME (PRAME nuclear receptor transcriptional regulator) [NCBI Gene 23532] {aka CT130, MAPE, OIP-4, OIP4}, DCXR (dicarbonyl and L-xylulose reductase) [NCBI Gene 51181] {aka DCR, HCR2, HCRII, KIDCR, P34H, PNTSU}, TCL1A (TCL1 family AKT coactivator A) [NCBI Gene 8115] {aka TCL1}, UMOD (uromodulin) [NCBI Gene 7369] {aka ADMCKD2, ADTKD1, FJHN, HNFJ, HNFJ1, MCKD2}, TNFRSF9 (TNF receptor superfamily member 9) [NCBI Gene 3604] {aka 4-1BB, CD137, CDw137, ILA, IMD109}, CA9 (carbonic anhydrase 9) [NCBI Gene 768] {aka CAIX, MN}, SPP1 (secreted phosphoprotein 1) [NCBI Gene 6696] {aka BNSP, BSPI, ETA-1, OPN}, PDE2A (phosphodiesterase 2A) [NCBI Gene 5138] {aka CGS-PDE, IDDPADS, PDE2A1, PED2A4, cGSPDE}, SLC6A4 (solute carrier family 6 member 4) [NCBI Gene 6532] {aka 5-HTT, 5-HTTLPR, 5HTT, HTT, OCD1, SERT}, FCRL3 (Fc receptor like 3) [NCBI Gene 115352] {aka CD307c, FCRH3, IFGP3, IRTA3, MAIA, SPAP2}
- **Diseases:** clear cell renal cell carcinoma (MESH:D002292), ovarian cancer (MESH:D010051), kidney (MESH:D007674), endometrial (MESH:D014591), immune (MESH:D007154), endometrial cancer (MESH:D016889), kidney and endometrial cancers (MESH:D007680), blood cancer (MESH:D019337), AML (MESH:D015470), chronic kidney disease (MESH:D051436), cancer (MESH:D009369), lung (MESH:D008171), MYEL (MESH:D009101), Lung cancer (MESH:D008175), CLL (MESH:D015451)
- **Chemicals:** DES (MESH:D004054)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12925248/full.md

---
Source: https://tomesphere.com/paper/PMC12925248