Developing a Predictive System for On-Time Graduation Using Logistic Regression

Authors

  • Tety Yuliaty Universitas Katolik Parahyangan
  • Gandhi Pawitan Universitas Katolik Parahyangan

DOI:

https://doi.org/10.57152/malcom.v5i4.2142

Keywords:

Academic Prediction, Higher Education, Logistic Regression, Machine Learning, On-Time Graduation

Abstract

Timely graduation is widely recognized as a key indicator of academic quality and institutional effectiveness in higher education. While previous studies have examined individual predictors of student progression, few have combined academic, demographic, and socioeconomic factors into a comprehensive predictive model, particularly within the context of Indonesian private universities. This study aims to identify the main factors influencing on-time graduation by applying logistic regression to student data collected from a private university’s academic information system. The dataset includes 9,012 undergraduate records from cohorts entering between 2017 and 2020, covering a range of academic, admission, and background variables. The analysis reveals that fourth-semester GPA, attendance rate, scholarship status, completion of mandatory courses, and early course load have a significant impact on the probability of graduating on time. The predictive model achieved an accuracy of 85.76% and a recall of 90%, demonstrating strong classification performance. Although the findings are based on data from a single institution, the results offer practical insights for developing academic early warning systems and inform data-driven planning in higher education management.

Downloads

Download data is not yet available.

References

Moraga-Pumarino, S. Salvo-Garrido, and K. Polanco-Levicán, “Profiles of University Students Who Graduate on Time: A Cohort Study from the Chilean Context,” Behavioral Sciences, vol. 13, no. 7, p. 582, 2023, doi: 10.3390/bs13070582.

Badan Akreditasi Nasional Perguruan Tinggi (BAN-PT), Peraturan BAN-PT Nomor 5 Tahun 2024 tentang Instrumen Pemantauan dan Evaluasi Mutu Perguruan Tinggi, Jakarta, Indonesia: BAN-PT, 2024. [Online]. Available: https://lldikti6.id/wp-content/uploads/2025/03/IPE-Mutu-PT-melalui-Mekanisme-Automasi_SW27022025.pdf

J. M. Aiken, R. De Bin, M. Hjorth-Jensen, and M. D. Caballero, “Predicting Time to Graduation at a Large Enrollment American University,” PLOS ONE, vol. 15, no. 11, p. e0242334, 2020, doi: 10.1371/journal.pone.0242334.

F. Ramadhani, S. D. Panggabean, and D. Siahaan, “Faktor-Faktor yang Mempengaruhi Kelulusan Tepat Waktu Mahasiswa Program Studi Akuntansi,” J. Ilm. Akuntansi, vol. 6, no. 2, pp. 135–144, 2021, doi: 10.23887/jia.v6i2.36781.

S. Herzog, “Estimating student retention and degree-completion time: Decision trees and neural networks vis-à-vis regression,” New Directions for Institutional Research, vol. 2005, no. 125, pp. 17–33, 2005, doi: 10.1002/ir.149.

R. Chen, A. D. Simpson, and M. P. St John, “Understanding the Different Pathways to College Persistence and Degree Attainment,” The Journal of Higher Education, vol. 89, no. 1, pp. 1–38, 2018, doi: 10.1080/00221546.2017.1341757.

M. D. Putri and B. Nugroho, “Prediksi Kelulusan Mahasiswa Menggunakan Algoritma Machine Learning,” J. Tek. Sist. Komput., vol. 9, no. 1, pp. 55–60, 2021, doi: 10.14710/jtsiskom.9.1.55-60.

C. H. Yu, S. DiGangi, A. Jannasch-Pennell, and C. Kaprolet, “A data mining approach for identifying predictors of student retention from sophomore to junior year,” J. Data Sci., vol. 8, no. 2, pp. 307–325, 2010, doi: 10.6339/JDS.2010.08(2).574.

B. Nguyen, M. Boyd, and K. Henderson, “Early Prediction of Student Dropout using Machine Learning Techniques,” in Proc. 2020 IEEE Global Eng. Educ. Conf. (EDUCON), Porto, Portugal, Apr. 2020, pp. 580–586, doi: 10.1109/EDUCON45650.2020.9125311.

A. Angeioplastis, J. Aliprantis, M. Konstantakis, and A. Tsimpiris, “Predicting Student Performance and Enhancing Learning Outcomes: A Data-Driven Approach Using Educational Data Mining Techniques,” Computers, vol. 14, no. 3, p. 83, 2025, doi: 10.3390/computers14030083.

A. Hussain et al., “A Comparison of Machine Learning Models for Predicting Student Academic Performance,” IEEE Access, vol. 9, 2021, doi: 10.1109/ACCESS.2021.3093889.

B. Yadav and R. Pal, “Performance Analysis of Ensemble Models for Student Performance Prediction,” Educ. Inf. Technol., vol. 27, pp. 1–17, 2022, doi: 10.1007/s10639-022-10884-w.

B. Pardos and N. Heffernan, “Navigating the Tradeoff Between Accuracy and Interpretability in Predictive Modeling,” J. Learn. Anal., vol. 7, no. 3, pp. 1–12, 2020, doi: 10.18608/jla.2020.73.1.

M. Chen and H. Wang, “Using Explainable AI in Education,” IEEE Trans. Learn. Technol., vol. 13, no. 3, pp. 477–488, 2020, doi: 10.1109/TLT.2020.2991430.

D. Gaševi?, S. Dawson, and G. Siemens, “Let’s not forget: Learning analytics are about learning,” TechTrends, vol. 60, no. 1, pp. 64–71, 2016, doi: 10.1007/s11528-015-0014-2.

C. Romero and S. Ventura, “Educational Data Mining: A Review of the State of the Art,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 50, no. 6, pp. 412–430, 2020, doi: 10.1109/TSMC.2018.2853804.

G. E. A. P. A. Batista and M. C. Monard, “An analysis of four missing data treatment methods for supervised learning,” Appl. Artif. Intell., vol. 34, no. 8, pp. 594–614, 2020, doi: 10.1080/08839514.2020.1767322.

B. Kovalerchuk and E. Triantaphyllou, “Preventing data leakage in machine learning models: pitfalls and recommendations,” SN Comput. Sci., vol. 3, no. 6, pp. 1–15, 2022, doi: 10.1007/s42979-022-01537-7.

D. Bowers, “Data Science in Education: Balancing Accuracy and Interpretability in Predictive Models,” Comput. Educ., vol. 179, p. 104415, 2022, doi: 10.1016/j.compedu.2022.104415.

C. F. Dormann et al., “Collinearity: a review of methods to deal with it and a simulation study evaluating their performance,” Ecography, vol. 36, no. 1, pp. 27–46, 2013, doi: 10.1111/j.1600-0587.2012.07348.x.

C. Molnar, Interpretable Machine Learning: A Guide for Making Black Box Models Explainable, 2nd ed., Leanpub, 2022. [Online]. Available: doi.org/10.48550/arXiv.2010.09337

M. Chen and H. Wang, “Using Explainable AI in Education,” IEEE Transactions on Learning Technologies, vol. 13, no. 3, pp. 477–488, 2020, doi: 10.1109/TLT.2020.2991430

W. Kurniawan, “Data-driven decision support system for student academic risk profiling using logistic regression,” J. Big Data, vol. 10, no. 1, pp. 1–14, 2023, doi: 10.1186/s40537-023-00752-x.

M. A. S. Pawitra, H.-C. Hung, and H. Jati, “A Machine Learning Approach to Predicting On-Time Graduation in Indonesian Higher Education,” Elinvo (Electronics, Informatics, and Vocational Education), vol. 9, no. 2, pp. 294–308, 2024, doi: 10.21831/elinvo.v9i2.77052.

M. Amri, R. Rasyid, and S. Widodo, “The development of academic early warning system using data mining approach for supporting student graduation prediction,” Journal of Physics: Conference Series, vol. 2165, no. 1, p. 012006, 2022, doi: 10.1088/1742-6596/2165/1/012006.

D.-M. Córdova-Esparza et al., “Predicting and Preventing School Dropout with Business Intelligence: Insights from a Systematic Review,” Information, vol. 16, no. 4, p. 326, 2025, doi: 10.3390/info16040326.

H. Khosravi et al., “Explainable Artificial Intelligence in Education,” Computers and Education: Artificial Intelligence, vol. 3, p. 100074, 2022, doi: 10.1016/j.caeai.2022.100074.

B. A. Schwendimann, M. J. Rodriguez-Triana, A. Vozniuk, et al., “Learning analytics dashboard: a tool for providing actionable insights to learners,” International Journal of Educational Technology in Higher Education, vol. 18, no. 1, pp. 1–17, 2021, doi: 10.1186/s41239-021-00313-7.

P. Yin and Y. Wang, “Academic risk prediction using interpretable machine learning models: A university case study,” Computers & Education, vol. 168, p. 104193, 2021, doi: 10.1016/j.compedu.2021.104193.

Downloads

Published

2025-10-30

How to Cite

Yuliaty, T., & Pawitan, G. (2025). Developing a Predictive System for On-Time Graduation Using Logistic Regression. MALCOM: Indonesian Journal of Machine Learning and Computer Science, 5(4), 1253-1265. https://doi.org/10.57152/malcom.v5i4.2142