Integrating SAINT with Tree-Based Models: A Case Study in Employee Attrition Prediction
Adil Derrazi, Javad Pourmostafa Roshan Sharami

TL;DR
This study evaluates integrating SAINT-generated embeddings with tree-based models for employee attrition prediction, finding that standalone tree models outperform hybrid approaches in accuracy and interpretability.
Contribution
It demonstrates that combining SAINT embeddings with tree-based models does not improve performance and reduces interpretability, challenging assumptions about hybrid model benefits.
Findings
Tree-based models outperform SAINT and hybrid models in accuracy.
Hybrid models do not improve predictive performance.
Hybrid approaches reduce model interpretability.
Abstract
Employee attrition presents a major challenge for organizations, increasing costs and reducing productivity. Predicting attrition accurately enables proactive retention strategies, but existing machine learning models often struggle to capture complex feature interactions in tabular HR datasets. While tree-based models such as XGBoost and LightGBM perform well on structured data, traditional encoding techniques like one-hot encoding can introduce sparsity and fail to preserve semantic relationships between categorical features. This study explores a hybrid approach by integrating SAINT (Self-Attention and Intersample Attention Transformer)-generated embeddings with tree-based models to enhance employee attrition prediction. SAINT leverages self-attention mechanisms to model intricate feature interactions. In this study, we explore SAINT both as a standalone classifier and as a feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
