# Supervised variable selection in randomised controlled trials prior to   exploration of treatment effect heterogeneity: an example from severe malaria

**Authors:** Chieh-Hsi Wu, Chris C. Holmes

arXiv: 1901.03531 · 2019-01-14

## TL;DR

This paper advocates for supervised variable selection in RCTs to improve detection of treatment effect heterogeneity, demonstrating theoretical and practical benefits over unsupervised methods, with application to severe malaria data.

## Contribution

It introduces a supervised variable selection method for TEH analysis in RCTs, showing it maintains power and controls false positives better than existing unsupervised approaches.

## Key findings

- Supervised variable selection improves TEH detection power.
- The method controls false-positive rates effectively.
- Application to severe malaria data illustrates practical benefits.

## Abstract

Exploration of treatment effect heterogeneity (TEH) is an increasingly important aspect of modern statistical analysis for stratified medicine in randomised controlled trials (RCTs) as we start to gather more information on trial participants and wish to maximise the opportunities for learning from data. However, the analyst should refrain from including a large number of variables in a treatment interaction discovery stage. Because doing so can significantly dilute the power to detect any true outcome-predictive interactions between treatments and covariates. Current guidance is limited and mainly relies on the use of unsupervised learning methods, such as hierarchical clustering or principal components analysis, to reduce the dimension of the variable space prior to interaction tests. In this article we show that outcome-driven dimension reduction, i.e. supervised variable selection, can maintain power without inflating the type-I error or false-positive rate. We provide theoretical and applied results to support our approach. The applied results are obtained from illustrating our framework on the dataset from an RCT in severe malaria. We also pay particular attention to the internal risk model approach for TEH discovery, which we show is a particular case of our method and we point to improvements over current implementation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.03531/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1901.03531/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/1901.03531/full.md

---
Source: https://tomesphere.com/paper/1901.03531