# Integrating Textual Features with Survival Analysis for Predicting Employee Turnover

**Authors:** Qian Ke, Yongze Xu

PMC · DOI: 10.3390/bs16020174 · Behavioral Sciences · 2026-01-26

## TL;DR

This paper introduces a method combining text analysis and survival modeling to predict employee turnover, improving accuracy and providing interpretable insights for HR strategies.

## Contribution

The novel integration of Transformer-based text analysis with survival modeling for employee turnover prediction.

## Key findings

- Combining textual and demographic features improved prediction performance with a 3.38% increase in C-index and 3.43% in AUC.
- Transformer-based methods outperformed traditional approaches in capturing employee sentiments.
- Survival analysis enhanced model adaptability and provided interpretable risk factors for turnover.

## Abstract

This study presents a novel methodology that integrates Transformer-based textual analysis from professional networking platforms with traditional demographic variables within a survival analysis framework to predict turnover. Using a dataset comprising 4087 work events from Maimai (a leading professional networking platform in China) spanning 2020 to 2022, our approach combines sentiment analysis and deep learning semantic representations to enhance predictive accuracy and interpretability for HR decision-making. Methodologically, we adopt a hybrid feature-extraction strategy combining theory-driven methods (sentiment analysis and TF-IDF) with a data-driven Transformer-based technique. Survival analysis is then applied to model time-dependent turnover risks, and we compare multiple models to identify the most predictive feature sets. Results demonstrate that integrating textual and demographic features improves prediction performance, specifically increasing the C-index by 3.38% and the cumulative/dynamic AUC by 3.43%. The Transformer-based method outperformed traditional approaches in capturing nuanced employee sentiments. Survival analysis further boosts model adaptability by incorporating temporal dynamics and also provides interpretable risk factors for turnover, supporting data-driven HR strategy formulation. This research advances turnover prediction methodology by combining text analysis with survival modeling, offering small and medium-sized enterprises a practical, data-informed approach to workforce planning. The findings contribute to broader labor market insights and can inform both organizational talent retention strategies and related policy-making.

## Full-text entities

- **Diseases:** NULL (MESH:C564833), MDI (MESH:D009123), IBS (MESH:D000081042), injury to (MESH:D014947)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12938496/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12938496/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/PMC12938496/full.md

---
Source: https://tomesphere.com/paper/PMC12938496