# Random survival forests for the analysis of recurrent events for right-censored data, with or without a terminal event

**Authors:** Juliette Murris, Olivier Bouaziz, Michal Jakubczak, Sandrine Katsahian, Audrey Lavenu

PMC · DOI: 10.1186/s12874-025-02678-z · BMC Medical Research Methodology · 2025-11-20

## TL;DR

This paper introduces RecForest, a new method for analyzing recurring medical events using random survival forests, which outperforms existing techniques.

## Contribution

The novel contribution is adapting random survival forests for right-censored recurrent event data with or without terminal events.

## Key findings

- RecForest outperformed existing methods with C-index values between 0.60 and 0.82.
- The method showed lowest mean squared error (MSE) metrics in simulations and real data.
- RecForest is publicly available as an R package on CRAN.

## Abstract

Random survival forests (RSF) have emerged as valuable tools in medical research. They have shown their utility in modelling complex relationships between predictors and survival outcomes, overcoming linearity or low dimensionality assumptions. Nevertheless, RSF have not been adapted to right-censored data with recurrent events (RE).

This work introduces RecForest, an extension of RSF and tailored for RE data, leveraging principles from survival analysis and ensemble learning. RecForest adapts the splitting rule to account for RE, with or without a terminal event, by employing the pseudo-score test or the Wald test derived from the marginal Ghosh-Lin model. The ensemble estimate is constructed by aggregating the expected number of events from each tree. Performance metrics involve a concordance index (C-index) tailored for RE analysis, along with an extension of the mean squared error (MSE). A comprehensive evaluation was conducted on both simulated and open-source data. We compared RecForest against the non-parametric mean cumulative function and the Ghosh-Lin model.

Across the simulations and application, RecForest consistently outperforms, exhibiting C-index values ranging from 0.60 to 0.82 and lowest MSE metrics.

As analysing time-to-recurrence data is critical in medical research, the proposed method represents a valuable addition to the analytical toolbox in this domain. The RecForest implementation is publicly available as an R package on CRAN.

The online version contains supplementary material available at 10.1186/s12874-025-02678-z.

## Full-text entities

- **Diseases:** RSF (MESH:D011475), death (MESH:D003643), colorectal cancer (MESH:D015179)
- **Chemicals:** NA (MESH:D012964)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12636200/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12636200/full.md

## References

2 references — full list in the complete paper: https://tomesphere.com/paper/PMC12636200/full.md

---
Source: https://tomesphere.com/paper/PMC12636200