JOURNAL ARTICLE

Comparative study of DistilBERT and ELECTRA-Small Models in Spam Email Classification

Ferdy Agusman

Year: 2025 Journal:   Jurnal Informatika Vol: 12 (2)Pages: 113-121   Publisher: LPPM BSI Bandung

Abstract

Spam email detection is one of the challenging tasks in cybersecurity due to the variability of spam content. These characteristics make it harder to identify spam, therefore researchers create different spam detection methods. Among these, Natural Language Processing (NLP) and machine learning techniques have shown outstanding results in classifying emails as spam or non-spam. Transformer-based models, such as BERT, have demonstrated pinpoint accuracy in text classification tasks. However, the computational requirements and resources are not practical in resource-limited environments. In order to mitigate this, smaller and more lightweight models, such as the DistilBERT and ELECTRA-Small, have been developed. Both models are renowned for their efficiency and accuracy. This study focuses on the comparison of these models in terms of accuracy, precision, recall, and F1 score. Experimental results revealed that while both models excel in binary classification, notable differences emerge. ELECTRA-small shows exceptional accuracy, precision and faster processing time, while DistilBERT demonstrates superior recall, highlighting its effectiveness in minimizing false negatives.

Keywords:
Computer science

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.42
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems
Misinformation and Its Impacts
Social Sciences →  Social Sciences →  Sociology and Political Science
Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.