JOURNAL ARTICLE

Research on Naive Bayesian Spam SMS Filtering Based on MapReduce

彩迪 赵

Year: 2016 Journal:   Computer Science and Application Vol: 06 (07)Pages: 443-450   Publisher: Hans Publishers

Abstract

针对海量短信文本的挖掘过滤需要很大的存储空间以及更强的计算能力,提出一种基于MapReduce的朴素贝叶斯的垃圾短信过滤方法;基于改进的朴素贝叶斯垃圾短信分类算法,利用MapReduce模型并行化对海量数据处理的优势进行短信文本的训练和测试。实验表明:利用计算集群实现海量垃圾短信过滤在召回率、查准率方面有所提高,垃圾短信过滤效率随着集群规模的扩增而提升较快。 The massive text mining filter requires a lot of storage space and stronger computing ability, so a spam message filtering method of MapReduce-based Bayesian is proposed. Based on the improved Naive Bayesian spam SMS classification algorithm, taking the advantage of MapReduce model pa-rallelization on massive data processing is used to train and test SMS text. Results show that using compute cluster to achieve massive spam filtering can improve the efficiency of recalling and pre-cision, and with the expansion of cluster size spam SMS filtering efficiency improve faster.

Keywords:
Computer science Naive Bayes classifier Spamming Data mining Machine learning Information retrieval Artificial intelligence World Wide Web The Internet Support vector machine

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
4
Refs
0.12
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems
© 2026 ScienceGate Book Chapters — All rights reserved.