JOURNAL ARTICLE

Penerapan Data Mining dalam Analisis Prediksi Kanker Paru Menggunakan Algoritma Random Forest

Abstract

Cancer is the second highest cause of death in the world. In Indonesia, it is a disease with a high mortality rate. Most patients do not realize that they have lung cancer thus the treatment is sometimes too late. A prediction method with a high degree of accuracy is needed to detect lung cancer earlier. Previous research used data mining calcification methods with the Naïve Bayes algorithm to predict lung cancer. This research resulted in high recall values for the positive class (Yes class) but low for the negative class (No class). This research was made using the Random Forest algorithm which is known to have good performance. The modeling is optimized by applying the K-fold Cross Validation technique. The Random Forest algorithm produces a higher Accuracy value than the Naïve Bayes algorithm, which is 98.4%. This algorithm produces 100% Recall for the positive class, 80% for the negative class and provides a 100% correct prediction as can be seen from the AUC value of 1. Although a statistical test with a significance level of 5% shows the results of the two algorithms are not significantly different.

Keywords:
Random forest Naive Bayes classifier Class (philosophy) Recall Bayes' theorem Statistics Lung cancer Mathematics Artificial intelligence Computer science Algorithm Pattern recognition (psychology) Medicine Internal medicine Psychology Bayesian probability

Metrics

11
Cited By
6.80
FWCI (Field Weighted Citation Impact)
16
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Mining and Machine Learning Applications
Physical Sciences →  Computer Science →  Information Systems
Edcuational Technology Systems
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimedia Learning Systems
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Analisis Data Mining Untuk Prediksi Kanker Payudara Menggunakan Algoritma Klasifikasi

Raden Tio Putra SudewoYovi PratamaElvi Yanti

Journal:   Jurnal Pustaka Data (Pusat Akses Kajian Database Analisa Teknologi dan Arsitektur Komputer) Year: 2023 Vol: 3 (2)Pages: 62-69
JOURNAL ARTICLE

Prediksi Kekambuhan Kanker Tiroid Menggunakan Algoritma Random Forest

Egi SafitriDani RofiantoSri KarnilaNurjoko NurjokoHendra KurniawanYuni ArkhiansyahRuki Rizal

Journal:   Jurnal SISKOM-KB (Sistem Komputer dan Kecerdasan Buatan) Year: 2025 Vol: 8 (3)Pages: 178-184
© 2026 ScienceGate Book Chapters — All rights reserved.