Predicting Employee Turnover Using Machine Learning Models Trained on SAP SuccessFactors and SAP HCM Historical Data

Abstract

Employee turnover is a persistent challenge for HR departments, especially within large organizations that utilize complex enterprise systems like SAP SuccessFactors and SAP HCM. This study presents a machine learning-based predictive framework to identify potential employee exits before they occur, leveraging historical HR data spanning performance, compensation, demographic, and behavioral metrics. By training and validating various machine learning models—including Random Forest, Gradient Boosting, and Neural Networks—on anonymized employee datasets extracted from SAP modules, we aim to uncover patterns and leading indicators of voluntary and involuntary turnover. The methodology incorporates data preprocessing, feature selection, class balancing, and model interpretability strategies such as SHAP values. Our results demonstrate that the Random Forest model achieved the highest accuracy at 86%, with critical predictors being low engagement scores, lack of internal mobility, and stagnant compensation growth. The study concludes by offering a framework for proactive retention strategies and outlines implications for integrating AI-driven insights directly into HR workflows. These findings contribute to the evolving practice of predictive HR analytics and establish a replicable pipeline for real-time turnover forecasting using enterprise resource data.

Country : USA

1 Manoj Parasa2 Sasi Kiran Parasa

  1. USA
  2. USA

IRJIET, Volume 5, Issue 12, December 2021 pp. 102-106

doi.org/10.47001/IRJIET/2021.512020

References

  1. Bassi, L. (2011). Raging debates in HR analytics. People and Strategy, 34(2), 14-18.
  2. Hausknecht, J. P., & Holwerda, J. A. (2013). When employees lack options: An empirical examination of job embeddedness and turnover. Journal of Applied Psychology, 98(3), 392–412.
  3. Kaur, P., & Chahal, R. (2018). A systematic review on employee attrition prediction. International Journal of Computer Applications, 179(9), 9–16.
  4. SHRM Foundation. (2016). Using Workforce Analytics for Competitive Advantage. SHRM.
  5. Hom, P. W., & Griffeth, R. W. (1995). Employee turnover. South-Western College Publishing.
  6. Ahmed, A., & Naser, M. (2020). Predicting employee attrition using XGBoost. IEEE Access, 8, 225423-225431.
  7. Saputra, R. F., et al. (2021). Gradient boosting decision trees in HR analytics. International Journal of Data Science, 6(1), 34–41.
  8. Jain, N., & Ranjan, R. (2020). Employee turnover prediction using decision trees. International Conference on Computing, Communication and Security, 112-117.
  9. Stone, D. L., & Deadrick, D. L. (2015). Challenges and opportunities affecting the future of human resource management. Human Resource Management Review, 25(2), 139-145.
  10. Tursunbayeva, A., et al. (2017). Human resource information systems in health care: A systematic review. Human Resource for Health, 15, 8.
  11. Lundberg, S., & Lee, S. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 4765–4774.
  12. Allen, D. G., &Shanock, L. R. (2013). Perceived organizational support and embeddedness. Journal of Applied Psychology, 98(6), 1046–1057.
  13. Mitchell, T. R., et al. (2001). Why people stay: Using job embeddedness to predict voluntary turnover. Academy of Management Journal, 44(6), 1102–1121.
  14. McEvoy, G. M., & Cascio, W. F. (1987). Do good or poor performers leave? Academy of Management Journal, 30(4), 744–762.
  15. Ribeiro, M. T., et al. (2016). "Why should I trust you?": Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD, 1135–1144.