Klasifikasi Hoax Menggunakan Metode TF-IDF + SVM

Penelitian

Authors

  • Avrillistianto Ananda Nabil Universitas Bina Sarana Informatika
  • Farih Ramdan Wildantama Universitas Bina Sarana Informatika
  • Dimas Satrianto Universitas Bina Sarana Informatika
  • Michael Gilbert Bakara Universitas Bina Sarana Informatika
  • Imam Budiawan Universitas Bina Sarana Informatika
  • Desi Mulyati Universitas Bina Sarana Informatika

DOI:

https://doi.org/10.31004/jerkin.v4i3.5078

Keywords:

Hoax News, TF-IDF, Support Vector Machine, Text Mining, Hoax Detection

Abstract

The spread of hoax news on social media causes social unrest and economic losses. This study builds a classification model for Indonesian hoax news using Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine (SVM). The dataset consists of 970 news from TurnBackHoax.id with FALSE and FRAUD categories. The research includes text preprocessing, TF-IDF feature extraction with unigram and bigram, and linear kernel SVM classification. Data was split 80:20 using stratified sampling with parameter optimization through Grid Search and 5-fold Cross Validation. Evaluation results show the model classifies hoax news with good performance based on accuracy, precision, recall, and f1-score metrics. The confusion matrix indicates most data was correctly classified despite errors in news with overlapping linguistic patterns. The study proves TF-IDF and SVM combination is effective for Indonesian hoax detection with low computational requirements. Further development is recommended using larger datasets and comparing with deep learning methods.

References

Aulia, R., & Kurniawati, D. (2022). Klasifikasi dokumen berita olahraga menggunakan metode TF-IDF dan Support Vector Machine. Jurnal Informatika, 9(2), 120–128.

Basuki, A., & Hendradi, T. (2022). Penyebaran hoaks di media sosial dan dampaknya terhadap masyarakat digital. Jurnal Komunikasi, 14(1), 45–56.

Desriana, R. (2024). Analisis fenomena hoaks di media sosial pada era digital. Jurnal Ilmu Komunikasi, 16(1), 33–42.

Desriansyah, R., & Utna Sari, P. (2024). Deteksi berita hoaks menggunakan pendekatan machine learning pada media daring. Jurnal Teknologi Informasi, 11(2), 101–110.

Dewi, S., Putra, A., & Lestari, N. (2025). Klasifikasi berita hoaks menggunakan metode TF-IDF dan Support Vector Machine. Jurnal Sistem Informasi, 13(1), 55–65.

Diki, R., & Betha, L. (2022). Karakteristik berita digital dan tantangan verifikasi informasi di era media sosial. Jurnal Jurnalisme, 7(2), 89–98.

Indra, A., Pratama, R., & Wulandari, S. (2025). Perbandingan metode SVM dan KNN dalam deteksi hoaks pada Twitter berbahasa Indonesia. Jurnal Data Science, 4(1), 15–25.

Mahendra, I., Nugroho, B., & Santoso, H. (2023). Pembobotan kata TF-IDF pada klasifikasi dokumen teks bahasa Indonesia. Jurnal Informatika dan Komputer, 10(3), 201–210.

Muhabatin, A., Sari, D., & Prakoso, E. (2021). Literasi digital dan penyebaran hoaks di media sosial. Jurnal Pendidikan Teknologi Informasi, 5(2), 67–75.

Pratiwi, L., Ramadhan, A., & Fauzi, M. (2021). Implementasi Support Vector Machine pada klasifikasi teks berita. Jurnal Teknologi Informasi dan Ilmu Komputer, 8(4), 301–308.

Ramadhan, F., Hidayat, R., & Maulana, A. (2022). Analisis berita hoaks di media sosial menggunakan pendekatan text mining. Jurnal Informatika, 9(1), 1–10.

Septiani, D., & Isabela, M. (2022). Analisis TF-IDF pada klasifikasi teks berita online. Jurnal Ilmu Komputer, 6(2), 75–83.

Downloads

Published

16-01-2025

How to Cite

Nabil, A. A., Wildantama, F. R., Satrianto, D., Bakara, M. G., Budiawan, I., & Mulyati, D. (2025). Klasifikasi Hoax Menggunakan Metode TF-IDF + SVM: Penelitian. Jurnal Pengabdian Masyarakat Dan Riset Pendidikan, 4(3), 17653–17660. https://doi.org/10.31004/jerkin.v4i3.5078

Most read articles by the same author(s)