Can Large Language Model Predict Employee Attrition?
Xiaoye Ma, Weiheng Liu, Changyi Zhao, Liliya R. Tukhvatulina

TL;DR
This paper demonstrates that fine-tuned GPT-3.5 significantly outperforms traditional machine learning models in predicting employee attrition, offering deeper insights and higher accuracy for HR management.
Contribution
It introduces the use of a fine-tuned GPT-3.5 model for employee attrition prediction and compares its performance to traditional ML classifiers.
Findings
GPT-3.5 achieved an F1-score of 0.92
Traditional models like SVM reached an F1-score of 0.82
LLMs can reveal deeper behavioral patterns in HR data
Abstract
Employee attrition poses significant costs for organizations, with traditional statistical prediction methods often struggling to capture modern workforce complexities. Machine learning (ML) advancements offer more scalable and accurate solutions, but large language models (LLMs) introduce new potential in human resource management by interpreting nuanced employee communication and detecting subtle turnover cues. This study leverages the IBM HR Analytics Attrition dataset to compare the predictive accuracy and interpretability of a fine-tuned GPT-3.5 model against traditional ML classifiers, including Logistic Regression, k-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Tree, Random Forest, AdaBoost, and XGBoost. While traditional models are easier to use and interpret, LLMs can reveal deeper patterns in employee behavior. Our findings show that the fine-tuned GPT-3.5…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI and HR Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Attention Is All You Need · Linear Layer · Cosine Annealing · Layer Normalization · Adam · Attention Dropout · Multi-Head Attention · Residual Connection
