JOURNAL ARTICLE

Optimizing URL-Based Phishing Detection Using XGBoost and Relief Feature Selection

Abstract

Phishing is a significant cybersecurity threat in which attackers exploit manipulated URLs to deceive users and obtain confidential information. As phishing attacks continue to grow in complexity, automated machine learning based detection methods have become essential to strengthen digital security. This study proposes a URL based phishing detection model using boosting algorithms while analyzing the role of feature selection in improving classification performance and computational efficiency. The experiments were conducted on a dataset consisting of 10000 instances with 50 features and balanced class labels. After data preparation, 48 features were retained as input variables, and min max normalization was applied to ensure uniform feature scaling. Three boosting algorithms namely Gradient Boosting, XGBoost, and AdaBoost were evaluated using accuracy, precision, recall, and F1 score. Among these methods, XGBoost achieved the highest accuracy of 98.8 percent, demonstrating its effectiveness in learning complex URL patterns. Subsequently, three feature selection techniques namely Information Gain, Chi Square, and ReliefF were applied and evaluated using 10 fold cross validation. The results indicate that ReliefF provides the most effective feature reduction by selecting 37 features while maintaining the same classification accuracy. Unlike previous studies that mainly focus on classifier comparison, this study demonstrates that integrating XGBoost with ReliefF enables significant feature dimensionality reduction without compromising predictive accuracy. This finding highlights an efficient trade off between detection performance and computational complexity. Overall, the proposed framework offers a robust, efficient, and scalable solution for fast and adaptive phishing detection in modern cybersecurity environments.

Keywords:

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems
Misinformation and Its Impacts
Social Sciences →  Social Sciences →  Sociology and Political Science
Cybercrime and Law Enforcement Studies
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Phishing URL Detection Using XGBoost

Abin Jose

Journal:   International Journal for Research in Applied Science and Engineering Technology Year: 2024 Vol: 12 (5)Pages: 1255-1260
JOURNAL ARTICLE

Phishing URL Detection Using XGBoost and Custom Feature Engineering

Pinnelli S. R. Prasad

Journal:   International Journal for Research in Applied Science and Engineering Technology Year: 2025 Vol: 13 (5)Pages: 675-686
JOURNAL ARTICLE

Phishing URL detection-based feature selection to classifiers

S. Carolin JeevaElijah Blessing Rajsingh

Journal:   International Journal of Electronic Security and Digital Forensics Year: 2017 Vol: 9 (2)Pages: 116-116
JOURNAL ARTICLE

Phishing URL detection-based feature selection to classifiers

S. Carolin JeevaElijah Blessing Rajsingh

Journal:   International Journal of Electronic Security and Digital Forensics Year: 2017 Vol: 9 (2)Pages: 116-116
JOURNAL ARTICLE

A Filter-Based Feature Selection for Robust Phishing Attack Detection using XGBoost

Isaac Dawandakpoye Ohwosoro

Journal:   International Journal of Advanced Research in Science Communication and Technology Year: 2024 Pages: 558-571
© 2026 ScienceGate Book Chapters — All rights reserved.