Optimizing URL-Based Phishing Detection Using XGBoost and Relief Feature Selection

Wahyu Suryaning Tyas; Fauzi Adi Rafrastara; Wildanil Ghozi

doi:10.33395/sinkron.v10i1.15651

ScienceGate Book Chapters

JOURNAL ARTICLE

Optimizing URL-Based Phishing Detection Using XGBoost and Relief Feature Selection

Wahyu Suryaning Tyas Fauzi Adi Rafrastara Wildanil Ghozi

Year: 2026 Journal: SinkrOn Vol: 10 (1)Pages: 430-438

DOI: 10.33395/sinkron.v10i1.15651

Get Full-Text PDF Get Analytical Report

Abstract

Phishing is a significant cybersecurity threat in which attackers exploit manipulated URLs to deceive users and obtain confidential information. As phishing attacks continue to grow in complexity, automated machine learning based detection methods have become essential to strengthen digital security. This study proposes a URL based phishing detection model using boosting algorithms while analyzing the role of feature selection in improving classification performance and computational efficiency. The experiments were conducted on a dataset consisting of 10000 instances with 50 features and balanced class labels. After data preparation, 48 features were retained as input variables, and min max normalization was applied to ensure uniform feature scaling. Three boosting algorithms namely Gradient Boosting, XGBoost, and AdaBoost were evaluated using accuracy, precision, recall, and F1 score. Among these methods, XGBoost achieved the highest accuracy of 98.8 percent, demonstrating its effectiveness in learning complex URL patterns. Subsequently, three feature selection techniques namely Information Gain, Chi Square, and ReliefF were applied and evaluated using 10 fold cross validation. The results indicate that ReliefF provides the most effective feature reduction by selecting 37 features while maintaining the same classification accuracy. Unlike previous studies that mainly focus on classifier comparison, this study demonstrates that integrating XGBoost with ReliefF enables significant feature dimensionality reduction without compromising predictive accuracy. This finding highlights an efficient trade off between detection performance and computational complexity. Overall, the proposed framework offers a robust, efficient, and scalable solution for fast and adaptive phishing detection in modern cybersecurity environments.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.95

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Misinformation and Its Impacts

Social Sciences → Social Sciences → Sociology and Political Science

Cybercrime and Law Enforcement Studies

Physical Sciences → Computer Science → Information Systems

Optimizing URL-Based Phishing Detection Using XGBoost and Relief Feature Selection

Abstract

Metrics

Topics

Related Documents

Phishing URL Detection Using XGBoost

Phishing URL Detection Using XGBoost and Custom Feature Engineering

Phishing URL detection-based feature selection to classifiers

Phishing URL detection-based feature selection to classifiers

A Filter-Based Feature Selection for Robust Phishing Attack Detection using XGBoost