STUDENT GRADUATION PREDICTION USING DECISION TREE ALGORITHM WITH CRISP-DM METHOD (CASE STUDY: ITB AHMAD DAHLAN)
Abstract
On-time graduation is an important indicator of higher education effectiveness; however, delays in student graduation are still observed at ITB Ahmad Dahlan Jakarta. This study develops a student graduation prediction system using the Cross-Industry Standard Process for Data Mining (CRISP-DM) methodology and the Decision Tree algorithm based on historical academic data. The model was built through six CRISP-DM stages, including problem understanding, data preparation, modeling, and evaluation. Testing results indicate high performance with an Accuracy of 97.44%, Precision of 97.14%, Recall of 100%, and F1-Score of 98.55%. This system has the potential to support strategic decision-making to enhance academic quality through data-driven approaches.
Full text article
References
Aregbeshola, A. R., & Adekunle, I. A. (2024). Demystifying colonialism and migration: An African perspective. Research in Globalization, 9, 100262. https://doi.org/10.1016/j.resglo.2024.100262
Ayteki?n, A. (2025). Determining the ideal integrated performance measurement approach for eco-entrepreneurship in manufacturing sectors. Journal of Cleaner Production, 528, 146758. https://doi.org/10.1016/j.jclepro.2025.146758
Bokrantz, J., Subramaniyan, M., & Skoogh, A. (2023). Realising the promises of artificial intelligence in manufacturing by enhancing CRISP-DM. Production Planning & Control, 34(15–16), 2234–2254. https://doi.org/10.1080/09537287.2023.2234882
Castro, C., Leiva, V., & Basso, F. (2025). A Data-Driven Systematic Review of the Metaverse in Transportation: Current Research, Computational Modeling, and Future Trends. CMES - Computer Modeling in Engineering and Sciences, 144(2), 1481–1543. https://doi.org/10.32604/cmes.2025.067992
Darmawan, A. K., Yudhisari, I., Anwari, A., & Makruf, M. (2023). Pola Prediksi Kelulusan Siswa Madrasah Aliyah Swasta dengan Support Vector Machine dan Random Forest. Jurnal Minfo Polgan, 12(2). https://doi.org/10.33395/jmp.v12i2.12388
de Andrés-Sánchez, J., Belzunegui-Eraso, A., Pastor Gosálbez, I., & Sánchez-Aragón, A. (2024). A cross-sectional assessment of the influence of information sources about substance use in adolescents’ tobacco prevalence. Heliyon, 10(19), e38976. https://doi.org/10.1016/j.heliyon.2024.e38976
Du, J., & Zhu, J. (2025). A decision support model for nursing chair design driven by patent literature analysis and trapezoidal fuzzy AHP. International Journal of Industrial Ergonomics, 110, 103811. https://doi.org/10.1016/j.ergon.2025.103811
Fitriyah, L. A., Wijayadi, A. W., & Hayati, N. (2020). Efikasi Diri, Kestabilan Emosi dan Keberhasilan Akademik Mahasiswa dalam Perkuliahan. DWIJA CENDEKIA: Jurnal Riset Pedagogik, 4(1), 44–51.
Hamid, F., & Roy, T. (2025). Unveiling Sociocultural Barriers to Breast Cancer Awareness Among the South Asian Population: Case Study of Bangladesh and West Bengal, India. JMIR Human Factors, 12. https://doi.org/10.2196/53969
Kar, M., Sadhukhan, S., & Parida, M. (2024). User satisfaction-based prioritisation of attributes influencing walk accessibility to metro stations: A multi-attribute decision making approach. Case Studies on Transport Policy, 17, 101255. https://doi.org/10.1016/j.cstp.2024.101255
Khan, A. U., Ma, Z., Li, M., Hu, W., Khan, M. N., Sohu, J. M., & Aziz, F. (2024). Beyond bookshelves, how 5/6G technology will reshape libraries: Two-stage SEM and SF-AHP analysis. Technology in Society, 78, 102629. https://doi.org/10.1016/j.techsoc.2024.102629
Li, H., Zhang, Z., Li, T., & Si, X. (2024). A review on physics-informed data-driven remaining useful life prediction: Challenges and opportunities. Mechanical Systems and Signal Processing, 209, 111120. https://doi.org/10.1016/j.ymssp.2024.111120
Liu, M., Zhang, H., Xu, Z., & Ding, K. (2024). The fusion of fuzzy theories and natural language processing: A state-of-the-art survey. Applied Soft Computing, 162, 111818. https://doi.org/10.1016/j.asoc.2024.111818
Ma, R., Wang, X., Gong, Y. Y., & Ensaff, H. (2025). The home food environment for children in Northern China. Appetite, 214, 108175. https://doi.org/10.1016/j.appet.2025.108175
Mariyam, F., Mehfuz, S., Asim, M., Ahuja, R., & Sadiq, Mohd. (2025). A rough-set based goal-oriented methodology for requirements analysis in information system design. Systems and Soft Computing, 7, 200410. https://doi.org/10.1016/j.sasc.2025.200410
Mukhsinah, E., Juniyanto, K., Amir, F., & Sundani, S. (2024). Strategi Mengatasi Tantangan Belajar Mahasiswa Universitas Pendidikan Indonesia Yang Aktif Dalam Bekerja. Jurnal Motivasi Pendidikan Dan Bahasa, 2(2), 25–36. https://doi.org/10.59581/jmpb-widyakarya.v2i2.3280
Pan, A. (2024). Enhanced SVM Algorithm-Based Dynamic Early Warning System for College English Ideological and Political Course Education Using Machine Learning. Journal of Cases on Information Technology, 26(1). https://doi.org/10.4018/JCIT.348657
Rafiq, U., Wang, X., & Guerra, E. (2025). Data analytics in software startups: Understanding key concepts and critical challenges. Information and Software Technology, 180, 107652. https://doi.org/10.1016/j.infsof.2024.107652
Rawat, A., Garg, C. P., & Sinha, P. (2025). Embracing strategies to overcome the electric vehicles adoption barriers in emerging market. Research in Transportation Business & Management, 60, 101337. https://doi.org/10.1016/j.rtbm.2025.101337
Rishabh, R., & Das, K. N. (2025). A fusion of decomposed fuzzy based decision-making and metaheuristic optimization system for sustainable planning of urban transport. Knowledge-Based Systems, 324, 113823. https://doi.org/10.1016/j.knosys.2025.113823
Sheikhkhoshkar, M., El-Haouzi, H. B., Aubry, A., Hamzeh, F., & Rahimian, F. (2025). A data-driven and knowledge-based decision support system for optimized construction planning and control. Automation in Construction, 173, 106066. https://doi.org/10.1016/j.autcon.2025.106066
Supangat, & Sulistyawan, M. R. (2023). Pemodelan Prediksi Tingkat Kelulusan Mahasiswa dengan Pendekatan Algoritma Naïve Bayes. Jurnal Informatika Polinema, 9(4). https://doi.org/10.33795/jip.v9i4.1367
Susila, I., Dean, D., Harismah, K., Priyono, K. D., Setyawan, A. A., & Maulana, H. (2024). Does interconnectivity matter? An integration model of agro-tourism development. Asia Pacific Management Review, 29(1), 104–114. https://doi.org/10.1016/j.apmrv.2023.08.003
Taleb, M. (2025). A general series network slacks-based measure approach for non-controllable inputs and undesirable outputs in mixed integer data envelopment analysis: An application to airport operations. Socio-Economic Planning Sciences, 99, 102211. https://doi.org/10.1016/j.seps.2025.102211
Villar, A., & de Andrade, C. R. V. (2024). Supervised machine learning algorithms for predicting student dropout and academic success: a comparative study. Discover Artificial Intelligence, 4(2). https://doi.org/10.1007/s44163-023-00079-z
Wang, L., & Zhou, H. (2025). A study on improving the teaching quality of college english classrooms based on fuzzy evaluation. Systems and Soft Computing, 7, 200407. https://doi.org/10.1016/j.sasc.2025.200407
Wijaya, R. (2020). Perbandingan Kinerja Algoritma Sorting pada Data Acak. Jurnal Informatika Nusantara, 9(1), 33–40.
Yücesan, E. (2025). Does deglobalization imply the end of global supply chains? International Business Review, 34(6), 102398. https://doi.org/10.1016/j.ibusrev.2025.102398
Authors
Copyright (c) 2025 Kholilah Husni, Elliya Sestri, Vany Terisia

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.