Prediksi Kelulusan Siswa Berdasarkan Data Demografis dan Akademik pada Dataset Student Performance

Penelitian

Authors

  • Ramadhani Zidan Arifin Universitas Pancasakti Tegal
  • Hasbi Firmansyah Universitas Pancasakti Tegal
  • Wahyu Asriyani Universitas Pancasakti Tegal

DOI:

https://doi.org/10.31004/jerkin.v4i2.4251

Keywords:

Logistic Regression, Graduation Prediction, Demographic Variables, Academic Factors, Educational Data Mining.

Abstract

This study aims to predict student graduation outcomes by utilizing demographic and academic variables from the Student Performance Dataset. The analysis was conducted using the Logistic regression method, selected for its ability to handle binary outcomes and provide clear interpretability of predictor contributions. The research process included data preprocessing, removal of variables G1 and G2 to prevent data leakage, and conversion of the final grade (G3) into a binary graduation label. The model was evaluated using accuracy, logistic loss, and a confusion matrix to measure predictive reliability and classification stability. The results indicate that the model achieved an accuracy of 78.85% with a logistic loss value of 0.412, demonstrating stable performance and good generalizability. These findings suggest that simple demographic and academic attributes—such as age, study time, prior failures, and attendance—play a significant role in predicting graduation likelihood. Overall, the study confirms that Logistic regression is an effective approach for educational data analysis and can be utilized by schools to identify at-risk students and design more targeted instructional interventions.

References

Anonymous. (2023). Prediksi Kategori Kelulusan Mahasiswa Menggunakan Metode Regresi Logistik Multinomial. ResearchGate Preprint. https://www.researchgate.net/publication/371084473_Prediksi_Kategori_Kelulusan_Mahasiswa_Menggunakan_Metode_Regresi_Logistik_Multinomial

Authors, U. P. (2022). Regresi Logistik Biner untuk Mengklasifikasikan Cara Belajar Mahasiswa. SciLine Journal (Unup Purwokerto). https://journal.unupurwokerto.ac.id/index.php/sciline/article/download/182/209/

Cortez, P., & Silva, A. (2008). Using Data Mining to Predict Secondary School Student Performance. Proceedings of the International Conference on Educational Data Mining / Related Workshop.

Elvida, N. (2024). Penerapan Data Mining untuk Prediksi Kelulusan Siswa. BCE Attractive Journal. https://attractivejournal.com/index.php/bce/article/view/1538

Febrinita, F. (2024). Faktor-Faktor yang Mempengaruhi Hasil Belajar Statistika: Analisis Empiris. Kognitif Journal. https://etdci.org/journal/kognitif/article/view/1588

Gunawan, P. H. (2025). Deteksi Tingkat Potensi Kelulusan Calon Mahasiswa Menggunakan Data Akademik Sekolah. STMSI (Unisi) Journal. https://sistemasi.ftik.unisi.ac.id/index.php/stmsi/article/download/5331/1032

JKTI, U. (2025). Predicting Student Graduation Using Logistic regression and Adam Optimization. Jurnal Teknologi Dan Informatika (JKTI). https://jurnal.unimus.ac.id/index.php/JKTI/article/viewFile/16189/pdf

Journal, U. K. (2020). Prediksi Kelulusan Tepat Waktu Berdasarkan Riwayat Akademik (Naive Bayes Study). Jurnal Decode (Universitas Halu Oleo / UM Kendari). https://journal.umkendari.ac.id/decode/article/download/308/144/1882

Junaidi, S. (2023). Prediksi Kelulusan Tepat Waktu Mahasiswa. EDikInformatika (Ejournal.Upgrisba.Ac.Id). https://ejournal.upgrisba.ac.id/index.php/eDikInformatika/article/view/7324

Kharis, S. A. A., & Zili, A. H. A. (2022). Learning Analytics dan Educational Data Mining pada Data Pendidikan. Jurnal Universitas Terbuka / ResearchGate Preprint. https://www.researchgate.net/publication/359631908_Learning_Analytics_dan_Educational_Data_Mining_pada_Data_Pendidikan

Latupeirissa, S. J. (2019). Pemodelan Lama Masa Studi Mahasiswa Menggunakan Regresi Logistik Ordinal. Jurnal FMIPA (Garuda / Repository Nasional). https://download.garuda.kemdikbud.go.id/article.php?article=1442829&title=PEMODELAN+LAMA+MASA+STUDI+MAHASISWA+FMIPA+UNPATTI+MENGGUNAKAN+REGRESI+LOGISTIK+ORDINAL+DENGAN+EFEK+INTERAKSI&val=17683

Maqfiroh, & Mujiyono, S. (2022). Penerapan Klasifikasi Algoritma Data Mining C4.5 untuk Memprediksi Tingkat Kelulusan Siswa. Attractive Journal. https://attractivejournal.com/index.php/bce/article/view/1538

Mk, C. R. P. (2025). Data Mining Classification Model For Timeliness Of Student Graduation. DE Journal (Undhari). https://ejournal.undhari.ac.id/index.php/de_journal/article/view/1349

STIS, P. (2019). Penerapan Metode Regresi Logistik Biner untuk Mengetahui Faktor Pengaruh. Prosiding Seminar Nasional Statistik (STIS). https://prosiding.stis.ac.id/index.php/semnasoffstat/article/download/146/43/

Triyasri, N. (2021). Prediction of Academic Success Using Logistic regression. Jurnal JUSTIN, Universitas Tanjungpura. https://jurnal.untan.ac.id/index.php/justin/article/download/89731/75676607260

Downloads

Published

14-12-2025

How to Cite

Ramadhani Zidan Arifin, Hasbi Firmansyah, & Wahyu Asriyani. (2025). Prediksi Kelulusan Siswa Berdasarkan Data Demografis dan Akademik pada Dataset Student Performance: Penelitian. Jurnal Pengabdian Masyarakat Dan Riset Pendidikan, 4(2), 13300–13307. https://doi.org/10.31004/jerkin.v4i2.4251

Most read articles by the same author(s)