# Warfarin dose estimation on multiple datasets with automated   hyperparameter optimisation and a novel software framework

**Authors:** Gianluca Truda, Patrick Marais

arXiv: 1907.05363 · 2020-12-02

## TL;DR

This paper evaluates machine learning algorithms for warfarin dose prediction across multiple datasets, introduces automated hyperparameter optimization via genetic programming, and presents a new software framework to enhance research reproducibility.

## Contribution

It introduces a novel software framework and automated hyperparameter optimization method that match expert-crafted models in warfarin dose estimation.

## Key findings

- Support vector and linear regression models performed best.
- Neural networks performed poorly compared to other models.
- Automated model optimization matched human expert models.

## Abstract

Warfarin is an effective preventative treatment for arterial and venous thromboembolism, but requires individualised dosing due to its narrow therapeutic range and high individual variation. Many machine learning techniques have been demonstrated in this domain. This study evaluated the accuracy of the most promising algorithms on the International Warfarin Pharmacogenetics Consortium dataset and a novel clinical dataset of South African patients. Support vectors and linear regression were amongst the top performers in both datasets and performed comparably to recent stacked ensemble approaches, whilst neural networks were one of the worst performers in both datasets. We also introduced genetic programming to automatically optimise model architectures and hyperparameters without human guidance. Remarkably, the generated models were found to match the performance of the best models hand-crafted by human experts. Finally, we present a novel software framework (Warfit-learn) for warfarin dosing research. It leverages the most successful techniques in preprocessing, imputation, and parallel evaluation, with the goal of accelerating research and making results in this domain more reproducible.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.05363/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/1907.05363/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/1907.05363/full.md

---
Source: https://tomesphere.com/paper/1907.05363