JOURNAL ARTICLE

Classification of Textual E‐Mail Spam Using Data Mining Techniques

Rasim M. АlgulievRamiz M. AliguliyevSaadat Nazirova

Year: 2011 Journal:   Applied Computational Intelligence and Soft Computing Vol: 2011 (1)   Publisher: Hindawi Publishing Corporation

Abstract

A new method for clustering of spam messages collected in bases of antispam system is offered. The genetic algorithm is developed for solving clustering problems. The objective function is a maximization of similarity between messages in clusters, which is defined by k ‐nearest neighbor algorithm. Application of genetic algorithm for solving constrained problems faces the problem of constant support of chromosomes which reduces convergence process. Therefore, for acceleration of convergence of genetic algorithm, a penalty function that prevents occurrence of infeasible chromosomes at ranging of values of function of fitness is used. After classification, knowledge extraction is applied in order to get information about classes. Multidocument summarization method is used to get the information portrait of each cluster of spam messages. Classifying and parametrizing spam templates, it will be also possible to define the thematic dependence from geographical dependence (e.g., what subjects prevail in spam messages sent from certain countries). Thus, the offered system will be capable to reveal purposeful information attacks if those occur. Analyzing origins of the spam messages from collection, it is possible to define and solve the organized social networks of spammers.

Keywords:
Automatic summarization Computer science Cluster analysis Data mining Convergence (economics) Genetic algorithm Fitness function Process (computing) Function (biology) Similarity (geometry) Information retrieval Artificial intelligence Machine learning Image (mathematics)

Metrics

38
Cited By
7.47
FWCI (Field Weighted Citation Impact)
32
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Spam Mail Filtering Using Data Mining Approach

Ajay Gupta

Advances in data mining and database management book series Year: 2020 Pages: 253-282
JOURNAL ARTICLE

Detecting Forged E-Mail using Data Mining Techniques

Prasanta Kumar SahooCheguri Rajitha

Journal:   International Journal of Engineering and Advanced Technology Year: 2019 Vol: 9 (1)Pages: 4017-4023
JOURNAL ARTICLE

Performance Evaluation of Data Mining based Classifier for Classification of Spam E-Mail

Manish Kumar Sahu

Journal:   International Journal for Research in Applied Science and Engineering Technology Year: 2017 Vol: V (IV)Pages: 1193-1196
BOOK-CHAPTER

Machine Learning Model for the E-mail Spam Detection with Data Mining Techniques

Sakshi KinhaGanesh GuptaKavita Kavita

Lecture notes in networks and systems Year: 2023 Pages: 511-519
© 2026 ScienceGate Book Chapters — All rights reserved.