JOURNAL ARTICLE

Indonesian language email spam detection using N-gram and Naïve Bayes algorithm

Yustinus VernandaSeng HansunMarcel Bonar Kristanda

Year: 2020 Journal:   Bulletin of Electrical Engineering and Informatics Vol: 9 (5)Pages: 2012-2019   Publisher: Institute of Advanced Engineering and Science (IAES)

Abstract

Indonesia is ranked the top 8th out of the total country population in the world for the global spammers. Web-based spam filter service with the REST API type can be used to detect email spam in the Indonesian language on the email server or various types of email server applications. With REST API, then there will be data exchange between the applications with JSON data type using existing HTTP commands. One type of spam filter commonly used is Bayesian Filtering, where the Naïve Bayes algorithm is used as a classification algorithm. Meanwhile, the N-gram method is used to increase the accuracy of the implementation of the Naïve Bayes algorithm in this study. N-gram and Naïve Bayes algorithms to detect spam email in the Indonesian language have successfully been implemented with accuracy around 0.615 until 0.94, precision at 0.566 until 0.924, recall at 0.96 until 1.00, and F-measure at 0.721 until 0.942. The best solution is found by using the 5-gram method with the highest score of accuracy at 0.94, precision at 0.924, recall at 0.96, and F-measure value at 0.942.

Keywords:
Naive Bayes classifier Algorithm JSON n-gram Computer science Precision and recall Population Machine learning Data mining Artificial intelligence Database Language model Medicine Support vector machine

Metrics

14
Cited By
1.03
FWCI (Field Weighted Citation Impact)
23
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Edcuational Technology Systems
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Mining and Machine Learning Applications
Physical Sciences →  Computer Science →  Information Systems
Multimedia Learning Systems
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Email Spam Detection using Naïve Bayes Algorithm

G. RevathiK. N. Brahmaji RaoG. Sita Ratnam

Journal:   International Journal for Research in Applied Science and Engineering Technology Year: 2022 Vol: 10 (9)Pages: 653-655
JOURNAL ARTICLE

Spam Email Detection using Naïve Bayes classifier

L. G. Wang

Journal:   ITM Web of Conferences Year: 2025 Vol: 70 Pages: 04028-04028
JOURNAL ARTICLE

Probability-based Naïve Bayes Algorithm for Email Spam Classification

A. SumithraA. AshifaS. HariniN. Kumaresan

Journal:   2022 International Conference on Computer Communication and Informatics (ICCCI) Year: 2022
© 2026 ScienceGate Book Chapters — All rights reserved.