# Explainable attrition risk scoring for managerial retention decisions in human resource analytics

**Authors:** M. S. Pavithran, S. M. Vadivel

PMC · DOI: 10.3389/fdata.2025.1699561 · Frontiers in Big Data · 2026-01-12

## TL;DR

This paper presents an interpretable machine learning model to predict employee attrition, helping HR managers make better retention decisions.

## Contribution

The novel contribution is a calibrated and interpretable attrition risk model using LIME, SHAP, and permutation importance for HR decision support.

## Key findings

- The Random Forest model achieved a high AUC-ROC score of 97.37% for attrition prediction.
- Calibration reduced the Brier Score from 0.03873 to 0.03480, improving prediction reliability.
- Interpretability tools like LIME and SHAP provided actionable insights into attrition risk factors.

## Abstract

Employee turnover remains a significant challenge for organizations as it becomes difficult for them to retain the same employees and continue with their operations efficiently. With the assistance of predictive analytics, HR managers will be able to foresee and lower the potential turnover. Conventional research has focused on the effectiveness of technical models, yet there is a lack of studies investigating the interpretability and reliability of managerial forecasts.

This research used the Employee Attrition dataset and applied various pre-processing methods, including label encoding, feature scaling, and SMOTE for class balancing. Machine learning models were trained and optimized using grid search with stratified cross-validation. The best-performing model was calibrated using the sigmoid method to ensure the accuracy of the predicted probabilities. LIME enabled local interpretability, thus providing practical insights into individual employee attrition-related risks. Permutation feature importance analysis and SHAP summary plots helped in better understanding the model by showing the individual features that contributed to the attrition probability.

The Random Forest classifier achieved the highest AUC-ROC score of 97.37%. Risk distribution visualizations highlight employees with the highest attrition probability, and calibration is the main reason for the Brier Score reduction from 0.03873 to 0.03480.

The study concludes that by prioritizing interventions and increasing the accuracy of retention strategies, a calibrated, interpretable, and risk-stratified model can enhance HR decision-making. This framework aids HR leaders in transitioning from reactive to proactive workforce management by leveraging data-driven insights.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12832383/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12832383/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/PMC12832383/full.md

---
Source: https://tomesphere.com/paper/PMC12832383