Implementasi Teknik Data Mining untuk Prediksi Kanker Paru – Paru Menggunakan Algoritma Decision Tree C4.5

Penelitian

Authors

  • Angela Putri Rudhiyanto Universitas Bina Sarana Informatika
  • Najla Damayanti Universitas Bina Sarana Informatika
  • Esthi Ayu Anggita Universitas Bina Sarana Informatika
  • Ananda Putri Maharani Universitas Bina Sarana Informatika
  • Amanda Chickita Aprilia Juanda Universitas Bina Sarana Informatika

DOI:

https://doi.org/10.31004/jerkin.v4i3.4976

Keywords:

Machine Learning, Linear Regression, Final Grade Prediction, Attendance, Assignment Score

Abstract

This study aims to implement data mining techniques to predict lung cancer using the Decision Tree C4.5 algorithm. The dataset used in this research is the Lung Cancer Dataset, consisting of 309 records and 16 attributes covering demographic factors, lifestyle habits, and clinical symptoms of patients. The research process was conducted using RapidMiner software through several stages, including data collection, data preprocessing, model construction, and model evaluation. The C4.5 algorithm generates a decision tree based on the Gain Ratio criterion to identify the most influential attributes in lung cancer classification.The results show that the Decision Tree C4.5 algorithm achieved excellent classification performance with an accuracy of 96.76%, a classification error of 3.24%, a Kappa value of 0.844, a weighted mean recall of 89.86%, and a weighted mean precision of 94.93%. The generated decision tree indicates that the ALLERGY attribute is the most dominant factor in classifying lung cancer, followed by other attributes such as Yellow_Finger, Peer_Pressure, and Swallowing_Difficulty. These findings indicate that the Decision Tree C4.5 algorithm is effective and highly interpretable for lung cancer prediction and has strong potential as an early decision-support tool in medical diagnosis.

References

Z. Fatah, “Jurnal Ilmiah Multidisiplin Nusantara Implementasi Metode Decision Tree Dalam Prediksi Kanker Paru Paru Dengan Rapidminer Jurnal Ilmiah Multidisiplin Nusantara,” vol. 2, no. November, pp. 176–184, 2024.

B. Andriska, C. Permana, and M. Djamaluddin, “Penerapan Python Dalam Data Mining Untuk Prediksi Kangker Paru Organisasi Kesehatan Dunia ( WHO ) menyatakan bahwa kangker merupakan kelompok penyakit yang berasal dari hampir seluruh organ tubuh dimana sel-sel yang terdapat pada organ tubuh tersebut tumb,” vol. 6, no. 2, 2023.

L. Kurniawati, D. Priyanto, N. Sulistianingsih, and M. Syahrir, “Perbandingan Metode Berbasis Decision Tree untuk Mendeteksi Penyakit Paru Comparison of Decision Tree-Based Methods in Lung Disease Detection,” vol. 7, no. 1, pp. 51–62, 2025, doi: 10.30812/bite.v7i1.4909.

Z. F. Aviatus Sholiha, “Klasifikasi Penyakit Paru-Paru Menggunakan Data Mining Decision Tree,” vol. 4, no. April, pp. 46–51, 2025.

D. A. Pratama, I. R. Mutaqin, and K. R. Manuela, “Analisis Terjadinya Kanker Paru-Paru Pada Pasien Menggunakan Decision Tree : Penerapan Algoritma C4 . 5 Dan RapidMiner Untuk Menentukan Risiko Kanker Pada Gejala Pasien Deigo Anugrah Pratama Program Studi Sistem Informasi FTI Universitas Bina Sarana Infor,” vol. 2, no. 4, 2023.

V. Kumar, D. Gupta, S. Juneja, and S. Kumari, “Multi-model machine learning framework for lung cancer risk prediction : A comparative analysis of nine classifiers with hybrid and ensemble approaches using behavioral and hematological parameters,” SLAS Technol., vol. 33, no. June, p. 100314, 2025, doi: 10.1016/j.slast.2025.100314.

C. A. Maulana, R. P. Pratama, and B. O. Lubis, “Penentuan Sparepart Genset Paling Sering Digunakan Pada Operator Indosat Ooredoo Hutchison Dengan,” vol. 5, no. 1, pp. 300–309, 2024.

J. Duque, “ScienceDirect ScienceDirect ScienceDirect Data Mining for Knowledge Management Data Mining for Knowledge Duque * Management,” Procedia Comput. Sci., vol. 239, no. 2022, pp. 257–264, 2024, doi: 10.1016/j.procs.2024.06.170.

M. F. Wahid, “Penerapan Data Mining Menggunakan Algoritma C4 . 5 untuk Klasifikasi Penyakit Paru-Paru Universitas Ibrahimy,” vol. 3, no. 5, pp. 646–653, 2025.

K. Jainudin and A. Abdullah, “Klasifikasi Penyakit Kanker Paru-Paru Menggunakan Metode,” vol. 8, no. 3, pp. 232–240, 2025.

Downloads

Published

30-12-2025

How to Cite

Rudhiyanto, A. P., Damayanti, N., Anggita, E. A., Maharani, A. P., & Juanda, A. C. A. (2025). Implementasi Teknik Data Mining untuk Prediksi Kanker Paru – Paru Menggunakan Algoritma Decision Tree C4.5 : Penelitian. Jurnal Pengabdian Masyarakat Dan Riset Pendidikan, 4(3), 16663–16669. https://doi.org/10.31004/jerkin.v4i3.4976