# Explainable Artificial Intelligence Unravels the Possible Distinct Roles of VKORC1 and CYP2C9 in Predicting Warfarin Anticoagulation Control

**Authors:** Kannan Sridharan, Gowri Sivaramakrishnan

PMC · DOI: 10.3390/medsci14010156 · 2026-03-22

## TL;DR

This study uses explainable AI to better understand how genetic factors like VKORC1 and CYP2C9 influence warfarin treatment effectiveness.

## Contribution

The study introduces XAI techniques to interpret ML predictions in warfarin pharmacogenomics, highlighting distinct roles of VKORC1 and CYP2C9.

## Key findings

- XGBoost and Random Forest models showed comparable accuracy in predicting poor anticoagulation control.
- SHAP analysis revealed VKORC1 provided stable risk signals while CYP2C9 caused prediction discordance.
- XAI improved model transparency, attributing predictions to genetic and clinical features.

## Abstract

Background: Warfarin pharmacogenomics is critical due to its narrow therapeutic index and significant interpatient variability. While machine learning (ML) can predict anticoagulation control status (ACS), its “black-box” nature limits clinical translatability. Explainable Artificial Intelligence (XAI) addresses this by providing interpretable insights. This study applied ML and XAI to a warfarin pharmacogenomic dataset to predict poor ACS and explain model decisions. Methods: A post hoc analysis was conducted on a cross-sectional dataset of 232 patients receiving warfarin for ≥6 months. Data included age, gender, interacting drugs, SAMe-TT2R2 score, and genotypes for CYP2C9, VKORC1, and CYP4F2. Poor ACS was defined as time in therapeutic range (TTR) < 70%. The dataset was split into training (70%) and testing (30%) cohorts. Three models, Random Forest, XGBoost, and Logistic Regression, were developed and evaluated using AUC-ROC, sensitivity, and specificity. XAI techniques, including permutation importance and SHapley Additive exPlanations (SHAP), were employed for global and local interpretability. Results: Of 232 patients, 141 (60.8%) had poor ACS. XGBoost and Random Forest demonstrated comparable predictive accuracy (AUC-ROC: 0.67), outperforming Logistic Regression. Sensitivity was 0.83 and 0.79 for XGBoost and Random Forest, respectively. However, specificity was modest for both ensemble methods (Random Forest: 0.48; XGBoost: 0.41) and extremely low for Logistic Regression (0.04), indicating poor discrimination, particularly for identifying patients with adequate anticoagulation control. Globally, important predictors included the age, SAMe-TT2R2 score, CYP2C9 (*2/*2), female gender, and VKORC1 (C/T). XAI revealed predictions were primarily driven by VKORC1, CYP4F2, SAMe-TT2R2 scores, and drug interactions. Concordance between XAI predictions and actual ACS was 78% for adequate and 88.6% for poor ACS. SHAP analysis showed VKORC1 provided a stable risk signal (mean absolute SHAP: 1.44 ± 0.49 in concordant cases), while CYP2C9 was a high-variance, high-impact driver of discordance (mean SHAP: 3.44 ± 3.79 in discordant cases). Conclusions: ML models, particularly ensemble methods, show modest ability to predict poor warfarin control with limited ability to correctly identify patients with adequate control from our dataset. XAI transforms these models into interpretable tools, with SHAP analysis attributing predictions to specific genetic and clinical features. While predictive accuracy remains modest, this approach enhances transparency and provides a foundation for generating hypotheses that may ultimately support clinical decision-making in pharmacogenomic-guided warfarin therapy.

## Linked entities

- **Genes:** VKORC1 (vitamin K epoxide reductase complex subunit 1) [NCBI Gene 79001], CYP2C9 (cytochrome P450 family 2 subfamily C member 9) [NCBI Gene 1559], CYP4F2 (cytochrome P450 family 4 subfamily F member 2) [NCBI Gene 8529]

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, CYP4F2 (cytochrome P450 family 4 subfamily F member 2) [NCBI Gene 8529] {aka CPF2}, CYP2C9 (cytochrome P450 family 2 subfamily C member 9) [NCBI Gene 1559] {aka CPC9, CYP2C, CYP2C10, CYPIIC9, P450-2C9, P450IIC9}, VKORC1 (vitamin K epoxide reductase complex subunit 1) [NCBI Gene 79001] {aka EDTP308, MST134, MST576, VKCFD2, VKOR}
- **Diseases:** XAI (MESH:C538243), bleeding (MESH:D006470), injury to (MESH:D014947), ACS (MESH:C536683), bleeding complications (MESH:D008107), thrombotic disorders (MESH:D013927), hypersensitivity (MESH:D004342), reactions (MESH:D006967)
- **Chemicals:** phenytoin (MESH:D010672), valproic acid (MESH:D014635), amiodarone (MESH:D000638), carbamazepine (MESH:D002220), SAMe (MESH:D012436), vitamin K (MESH:D014812), Warfarin (MESH:D014859)
- **Species:** Nicotiana tabacum (American tobacco, species) [taxon 4097], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** rs9923231, rs1057910, rs2108622, rs1799853, C/T

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13027873/full.md

---
Source: https://tomesphere.com/paper/PMC13027873