Prediksi Kelulusan Siswa Berdasarkan Data Demografis dan Akademik pada Dataset Student Performance
Penelitian
DOI:
https://doi.org/10.31004/jerkin.v4i2.4251Keywords:
Logistic Regression, Graduation Prediction, Demographic Variables, Academic Factors, Educational Data Mining.Abstract
This study aims to predict student graduation outcomes by utilizing demographic and academic variables from the Student Performance Dataset. The analysis was conducted using the Logistic regression method, selected for its ability to handle binary outcomes and provide clear interpretability of predictor contributions. The research process included data preprocessing, removal of variables G1 and G2 to prevent data leakage, and conversion of the final grade (G3) into a binary graduation label. The model was evaluated using accuracy, logistic loss, and a confusion matrix to measure predictive reliability and classification stability. The results indicate that the model achieved an accuracy of 78.85% with a logistic loss value of 0.412, demonstrating stable performance and good generalizability. These findings suggest that simple demographic and academic attributes—such as age, study time, prior failures, and attendance—play a significant role in predicting graduation likelihood. Overall, the study confirms that Logistic regression is an effective approach for educational data analysis and can be utilized by schools to identify at-risk students and design more targeted instructional interventions.
References
Anonymous. (2023). Prediksi Kategori Kelulusan Mahasiswa Menggunakan Metode Regresi Logistik Multinomial. ResearchGate Preprint. https://www.researchgate.net/publication/371084473_Prediksi_Kategori_Kelulusan_Mahasiswa_Menggunakan_Metode_Regresi_Logistik_Multinomial
Authors, U. P. (2022). Regresi Logistik Biner untuk Mengklasifikasikan Cara Belajar Mahasiswa. SciLine Journal (Unup Purwokerto). https://journal.unupurwokerto.ac.id/index.php/sciline/article/download/182/209/
Cortez, P., & Silva, A. (2008). Using Data Mining to Predict Secondary School Student Performance. Proceedings of the International Conference on Educational Data Mining / Related Workshop.
Elvida, N. (2024). Penerapan Data Mining untuk Prediksi Kelulusan Siswa. BCE Attractive Journal. https://attractivejournal.com/index.php/bce/article/view/1538
Febrinita, F. (2024). Faktor-Faktor yang Mempengaruhi Hasil Belajar Statistika: Analisis Empiris. Kognitif Journal. https://etdci.org/journal/kognitif/article/view/1588
Gunawan, P. H. (2025). Deteksi Tingkat Potensi Kelulusan Calon Mahasiswa Menggunakan Data Akademik Sekolah. STMSI (Unisi) Journal. https://sistemasi.ftik.unisi.ac.id/index.php/stmsi/article/download/5331/1032
JKTI, U. (2025). Predicting Student Graduation Using Logistic regression and Adam Optimization. Jurnal Teknologi Dan Informatika (JKTI). https://jurnal.unimus.ac.id/index.php/JKTI/article/viewFile/16189/pdf
Journal, U. K. (2020). Prediksi Kelulusan Tepat Waktu Berdasarkan Riwayat Akademik (Naive Bayes Study). Jurnal Decode (Universitas Halu Oleo / UM Kendari). https://journal.umkendari.ac.id/decode/article/download/308/144/1882
Junaidi, S. (2023). Prediksi Kelulusan Tepat Waktu Mahasiswa. EDikInformatika (Ejournal.Upgrisba.Ac.Id). https://ejournal.upgrisba.ac.id/index.php/eDikInformatika/article/view/7324
Kharis, S. A. A., & Zili, A. H. A. (2022). Learning Analytics dan Educational Data Mining pada Data Pendidikan. Jurnal Universitas Terbuka / ResearchGate Preprint. https://www.researchgate.net/publication/359631908_Learning_Analytics_dan_Educational_Data_Mining_pada_Data_Pendidikan
Latupeirissa, S. J. (2019). Pemodelan Lama Masa Studi Mahasiswa Menggunakan Regresi Logistik Ordinal. Jurnal FMIPA (Garuda / Repository Nasional). https://download.garuda.kemdikbud.go.id/article.php?article=1442829&title=PEMODELAN+LAMA+MASA+STUDI+MAHASISWA+FMIPA+UNPATTI+MENGGUNAKAN+REGRESI+LOGISTIK+ORDINAL+DENGAN+EFEK+INTERAKSI&val=17683
Maqfiroh, & Mujiyono, S. (2022). Penerapan Klasifikasi Algoritma Data Mining C4.5 untuk Memprediksi Tingkat Kelulusan Siswa. Attractive Journal. https://attractivejournal.com/index.php/bce/article/view/1538
Mk, C. R. P. (2025). Data Mining Classification Model For Timeliness Of Student Graduation. DE Journal (Undhari). https://ejournal.undhari.ac.id/index.php/de_journal/article/view/1349
STIS, P. (2019). Penerapan Metode Regresi Logistik Biner untuk Mengetahui Faktor Pengaruh. Prosiding Seminar Nasional Statistik (STIS). https://prosiding.stis.ac.id/index.php/semnasoffstat/article/download/146/43/
Triyasri, N. (2021). Prediction of Academic Success Using Logistic regression. Jurnal JUSTIN, Universitas Tanjungpura. https://jurnal.untan.ac.id/index.php/justin/article/download/89731/75676607260
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Ramadhani Zidan Arifin, Hasbi Firmansyah, Wahyu Asriyani

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.












