JOURNAL ARTICLE

Neural Malware Control with Deep Reinforcement Learning

Abstract

Antimalware products are a key component in detecting malware attacks, and their engines typically execute unknown programs in a sandbox prior to running them on the native operating system. Files cannot be scanned indefinitely so the engine employs heuristics to determine when to halt execution. Previous research has investigated analyzing the sequence of system calls generated during this emulation process to predict if an unknown file is malicious, but these models often require the emulation to be stopped after executing a fixed number of events from the beginning of the file. Also, these classifiers are not accurate enough to halt emulation in the middle of the file on their own. In this paper, we propose a novel algorithm which overcomes this limitation and learns the best time to halt the file's execution based on deep reinforcement learning (DRL). Because the new DRL-based system continues to emulate the unknown file until it can make a confident decision to stop, it prevents attackers from avoiding detection by initiating malicious activity after a fixed number of system calls. Results show that the proposed malware execution control model automatically halts emulation for 91.3% of the files earlier than heuristics employed by the engine. Furthermore, classifying the files at that time significantly improves the classifier's accuracy. This new model improves the true positive rate by 61.5%, at a false positive rate of 1%, compared to the best baseline classifier.

Keywords:
Emulation Computer science Malware Reinforcement learning System call Heuristics Artificial intelligence Learning classifier system Classifier (UML) Machine learning Artificial neural network Operating system

Metrics

16
Cited By
1.64
FWCI (Field Weighted Citation Impact)
49
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Malware Detection Techniques
Physical Sciences →  Computer Science →  Signal Processing
Adversarial Robustness in Machine Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Network Security and Intrusion Detection
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Actor Critic Deep Reinforcement Learning for Neural Malware Control

Yu WangJack W. StokesMady Marinescu

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2020 Vol: 34 (01)Pages: 1005-1012
JOURNAL ARTICLE

EvadeRL: Evading PDF Malware Classifiers with Deep Reinforcement Learning

Zhengyang MaoZhiyang FangMeijin LiYang Fan

Journal:   Security and Communication Networks Year: 2022 Vol: 2022 Pages: 1-14
JOURNAL ARTICLE

Reinforcement Learning with Deep Quantum Neural Networks

Wei HuJames Lee Hu

Journal:   Journal of Quantum Information Science Year: 2019 Vol: 09 (01)Pages: 1-14
© 2026 ScienceGate Book Chapters — All rights reserved.