# calibmsm: An R package for calibration plots of the transition probabilities in a multistate model

**Authors:** Alexander Pate, Matthew Sperrin, Richard D. Riley, Ben Van Calster, Glen P. Martin, Md Rahaman Khan, Md Rahaman Khan, Md Rahaman Khan

PMC · DOI: 10.1371/journal.pone.0320504 · PLOS One · 2025-06-04

## TL;DR

The calibmsm R package helps assess how well multistate survival models predict transitions between health states, such as recovery or death, in clinical settings.

## Contribution

calibmsm introduces a novel R package for evaluating the calibration of transition probabilities in multistate models using three distinct methods.

## Key findings

- Predictions of relapse at the time of transplant are poorly calibrated.
- Predictions of death are well calibrated, but predictions at 100 days post-transplant show poor calibration.
- The package supports calibration assessment using landmarking and pseudo-values.

## Abstract

Multistate models, which allow the prediction of complex multistate survival processes such as multimorbidity, or recovery, relapse and death following treatment for cancer, are being used for clinical prediction. It is paramount to evaluate the calibration (as well as other metrics) of a risk prediction model before implementation of the model. While there are a number of software applications available for developing multistate models, currently no software exists to aid in assessing the calibration of a multistate model, and as a result evaluation of model performance is uncommon. calibmsm has been developed to fill this gap.

Assessing the calibration of predicted transition probabilities between any two states is made possible through three approaches. The first two utilise calibration techniques for binary and multinomial logistic regression models in combination with inverse probability of censoring weights, whereas the third utilises pseudo-values. All methods are implemented in conjunction with landmarking to allow calibration assessment of predictions made at any time beyond the start of follow up. This study focuses on calibration curves, but the methodological framework also allows estimation of calibration slopes and intercepts.

This article serves as a guide on how to use calibmsm to assess the calibration of any multistate model, via a comprehensive example evaluating a model developed to predict recovery, adverse events, relapse and survival in patients with blood cancer after a transplantation. The calibration plots indicate that predictions of relapse made at the time of transplant are poorly calibrated, however predictions of death are well calibrated. The calibration of all predictions made at 100 days post transplant appear to be poor, although a larger validation sample is required to make stronger conclusions.

calibmsm is an R package which allows users to assess the calibration of predicted transition probabilities from a multistate model. Evaluation of model performance is a key step in the pathway to model implementation, yet evaluation of the performance of predictions from multistate models is not common. We hope availability of software will help model developers evaluate the calibration of models being developed.

## Linked entities

- **Diseases:** blood cancer (MONDO:0002334)

## Full-text entities

- **Diseases:** death (MESH:D003643), cancer (MESH:D009369), blood cancer (MESH:D019337)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12136353/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12136353/full.md

## References

50 references — full list in the complete paper: https://tomesphere.com/paper/PMC12136353/full.md

---
Source: https://tomesphere.com/paper/PMC12136353