Implementation of Machine Learning to Predict The Timeliness of Graduation of Employees on Study Assignment at Company X
DOI:
https://doi.org/10.57152/malcom.v6i2.2474Keywords:
Energy Transition, Graduation Timeliness, Educational Data Mining, Machine Learning, Study AssignmentAbstract
The energy transition requires workers in the energy sector who have relevant skills that can be applied in the future. Company X implements a study assignment program to improve its employees' skills, but delays in completing their studies hinder their readiness to enter the workforce. Identifying the factors that influence graduation timeliness can improve the program's effectiveness. This study aims to develop a predictive model to determine whether employees in Company X's work-study program will graduate on time. The main purpose of this model is to provide early warnings about employees at risk of delays, enabling more targeted interventions to improve human resource management. We applied the CRISP-DM framework and used Machine Learning to analyze data from 317 employees who participated in the study program. Four machine learning algorithms were tested, namely Gradient Boosting, Decision Tree, Random Forest, and Naive Bayes. 17 factors were trained to cover academic, demographic, and administrative aspects to predict timely graduation. Among the algorithms tested, Gradient Boosting showed the best performance with an AUC of 0.956 and an accuracy of 0.909. These results were supported by high ROC and confusion matrix values, indicating the model's excellent predictive ability.
Downloads
References
A. Tharwat, “Classification assessment methods,” in Applied Computing and Informatics, vol. 17, no. 1, pp. 168–192, 2021. doi: 10.1016/j.aci.2018.08.003.
A. Margherita, “Human resources analytics: A systematization of research topics and directions for future research,” in Human Resource Management Review, vol. 32, no. 2, 100795, 2022. doi: 10.1016/j.hrmr.2020.100795.
C. Bentéjac, A. Csörg?, and G. Martínez-Muñoz, “A comparative analysis of gradient boosting algorithms,” in Artificial Intelligence Review, vol. 54, pp. 1937–1967, 2021. doi: 10.1007/s10462-020-09896-5.
C. Romero and S. Ventura, “Educational Data Mining and Learning Analytics,” in WIREs Data Mining and Knowledge Discovery, 2020. doi: 10.1002/widm.1355.
C. Schröer et al., “A Systematic Review on Applying CRISP-DM Process Model,” in Procedia Computer Science, vol. 181, pp. 526–534, 2021. doi: 10.1016/j.procs.2021.01.199.
D. Chicco and G. Jurman, “The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation,” in BMC Genomics, vol. 21, 2020. doi: 10.1186/s12864-019-6413-7.
D. Minbaeva, “Disrupted HR?,” in Human Resource Management Review, vol. 31, no. 4, 100820, 2021. doi: 10.1016/j.hrmr.2020.100820.
E. Richardson et al., “The receiver operating characteristic curve accurately assesses imbalanced datasets,” in Patterns, vol. 5, no. 6, 100994, 2024. doi: 10.1016/j.patter.2024.100994.
F. Lukitasari, T. Wrahatnolo, A. Wardhono, A. Yushila, H. Muyasyaroh, A. Azizah, M. Shahuddin, and L. Nurlaela, “Employability Skills: A Systematic Literature Review on Skills, Challenges, and Curriculum Integration in Higher Education,” in E3S Web of Conferences, vol. 645, 06007, 2025. doi: 10.1051/e3sconf/202564506007.
F. Martínez-Plumed et al., “CRISP-DM Twenty Years Delayedr: From Data Mining Processes to Data Science Trajectories,” in IEEE Transactions on Knowledge and Data Engineering, 2021. doi: 10.1109/TKDE.2019.2962680.
F. Soheli, “Classification of Academic Performance for University Research Evaluation by Implementing Modified Naive Bayes Algorithm,” in Procedia Computer Science, vol. 194, pp. 224–228, 2021. doi: 10.1016/j.procs.2021.10.077.
International Energy Agency, World Energy Employment 2024, IEA, Paris, 2024. [Online]. Available: https://www.iea.org/reports/world-energy-employment-2024.
International Renewable Energy Agency, Renewable Energy and Jobs: Annual Review 2023, IRENA, Abu Dhabi, 2023. [Online]. Available: https://www.irena.org/Publications/2023/Sep/Renewable-Energy-and-Jobs-Annual-Review-2023
J. Li, “Area under the ROC Curve has the most consistent evaluation for binary classification,” in PLOS ONE, vol. 19, no. 12, e0316019, 2024. doi: 10.1371/journal.pone.0316019.
K. Nahar et al., “Mining educational data to predict students’ performance,” in Education and Information Technologies, vol. 26, pp. 6051–6067, 2021. doi: 10.1007/s10639-021-10575-3.
M. Klonowska, “Human Capital and the Sustainable Energy Transition: A Socio-Economic Perspective,” in Sustainability, vol. 17, 10710, 2025. doi: 10.3390/su172310710.
M. Kuhn and K. Johnson, Feature Engineering and Selection: A Practical Approach for Predictive Models, 1st ed., Chapman and Hall/CRC, 2019. doi: 10.1201/9781315108230.
M. Ya?c?, “Educational data mining: Prediction of students’ academic performance using machine learning algorithms,” in Smart Learning Environments, vol. 9, no. 11, 2022. doi: 10.1186/s40561-022-00192-z.
P. Muthukrishnan et al., “Key Factors Influencing Graduation on Time Among Postgraduate Students: A PLS-SEM Approach,” in Asian Journal of University Education, vol. 18, no. 1, p. 51, 2022. doi: 10.24191/ajue.v18i1.17169.
S. O. Oppong, “Predicting Students’ Performance Using Machine Learning Algorithms: A Review,” in Asian Journal of Research in Computer Science, vol. 16, no. 3, pp. 128–148, 2023. doi: 10.9734/AJRCOS/2023/v16i3351.
S. Raschka, “Model evaluation, model selection, and algorithm selection in machine learning,” in arXiv preprint arXiv:1811.12808, 2018.
T. Abd El-Hafeez and A. Omar, “Student Performance Prediction Using Machine Learning Techniques,” 2022. doi: 10.21203/rs.3.rs-1455610/v1.
W. Zhang, Y. Wang, and S. Wang, “Predicting academic performance using tree-based machine learning models: A case study of bachelor students in an engineering department in China,” in Education and Information Technologies, vol. 27, no. 9, pp. 13051–13066, 2022. doi: 10.1007/s10639-022-11170-w.
World Economic Forum, The Future of Jobs Report 2023, World Economic Forum, Geneva, Switzerland, 2023. [Online]. Available: https://www.weforum.org/reports/the-future-of-jobs-report-2023.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Parenda Rizkya Permata, Imam Yuadi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Copyright © by Author; Published by Institut Riset dan Publikasi Indonesia (IRPI)
This Indonesian Journal of Machine Learning and Computer Science is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

















