JOURNAL ARTICLE

Improving Machine Learning Models for Malware Detection Using Embedded Feature Selection Method

Mohammed ChemmakhaOmar HabibiMohamed Lazaar

Year: 2022 Journal:   IFAC-PapersOnLine Vol: 55 (12)Pages: 771-776   Publisher: Elsevier BV

Abstract

Machine learning performance always rely on relevant phase of pre-processing, that includes dataset cleaning, cleansing and extraction. Feature selection (FS) is a crucial phase too, because it is intended to increase the efficiency of Machine Learning (ML) models in terms of predictiveness, by assigning a representative value to the most important features in a dataset of malware. In this study, we focus on feature selection using embedded-based methods in order to minimize computational time and complexity of ML models. Embedded-based methods combine advantages of both filter-based and wrapped-based methods, in terms of studying the importance of features while executing the model and their reduced time of execution. Applying ML models shows a high stability of models will selecting 10 most relevant features from the dataset, with an accuracy that achieve 99.47%, 99.02% for respectively Random Forest (RF) and XGBoost (XGB).

Keywords:
Computer science Feature selection Random forest Machine learning Malware Artificial intelligence Focus (optics) Stability (learning theory) Selection (genetic algorithm) Feature (linguistics) Filter (signal processing) Feature extraction Data mining Computer vision

Metrics

28
Cited By
5.26
FWCI (Field Weighted Citation Impact)
14
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Malware Detection Techniques
Physical Sciences →  Computer Science →  Signal Processing
Network Security and Intrusion Detection
Physical Sciences →  Computer Science →  Computer Networks and Communications
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.