JOURNAL ARTICLE

Reducing Web Spam using Content and Link Analysis

Dereck Shepherd ChikuniMemory Manzungu

Year: 2017 Journal:   Asian Accounting and Auditing Advancement Vol: 8 (1)Pages: 18-23

Abstract

Techniques of search engine manipulation are increasing rapidly making the importance of anti-web spam filters evident. In this paper, we fuse content analysis metrics and link analysis algorithms to retrieve relevant documents while blocking spam pages. We compare the efficiency of the algorithm with well-known link algorithms. This implementation aims to maintain a high recall/precision ratio while using two levels of filtering. The hybrid implementation outperforms the popular HITS.

Keywords:
Computer science Search engine Information retrieval Link analysis Precision and recall Spamdexing Blocking (statistics) Data mining World Wide Web Web search query Computer network Web search engine

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.59
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems
Complex Network Analysis Techniques
Physical Sciences →  Physics and Astronomy →  Statistical and Nonlinear Physics

Related Documents

JOURNAL ARTICLE

Spam web page detection using combined content and link features

Rajendra Kumar RoulShubham Rohan AsthanaGaurav Kumar

Journal:   International Journal of Data Mining Modelling and Management Year: 2016 Vol: 8 (3)Pages: 209-209
BOOK-CHAPTER

Combating Link Spam by Noisy Link Analysis

Yitong WangXiaofei ChenXiaojun Feng

Lecture notes in computer science Year: 2010 Pages: 453-464
JOURNAL ARTICLE

Spam Link Detection using Graph Mining

Akankasha MishraSheetal Mehta

Journal:   International Journal of Computer Applications Year: 2014 Vol: 91 (17)Pages: 11-14
© 2026 ScienceGate Book Chapters — All rights reserved.