JOURNAL ARTICLE

Scalable malware clustering through coarse-grained behavior modeling

Abstract

Anti-malware vendors receive several thousand new malware (malicious software) variants per day. Due to large volume of malware samples, it has become extremely important to group them based on their malicious characteristics. Grouping of malware variants that exhibit similar behavior helps to generate malware signatures more efficiently. Unfortunately, exponential growth of new malware variants and huge-dimensional feature space, as used in existing approaches, make the clustering task very challenging and difficult to scale. Furthermore, malware behavior modeling techniques proposed in the literature do not scale well, where malware feature space grows in proportion with the number of samples under examination. In this paper, we propose a scalable malware behavior modeling technique that models the interactions between malware and sensitive system resources in a coarse-grained manner. Coarse-grained behavior modeling enables us to generate malware feature space that does not grow in proportion with the number of samples under examination. A preliminary study shows that our approach generates 289 times less malware features and yet improves the average clustering accuracy by 6.20% comparing to a state-of-the-art malware clustering technique.

Keywords:
Malware Computer science Cluster analysis Scalability Malware analysis Software Feature (linguistics) Data mining Scale (ratio) Machine learning Artificial intelligence Computer security Operating system

Metrics

20
Cited By
0.98
FWCI (Field Weighted Citation Impact)
3
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Malware Detection Techniques
Physical Sciences →  Computer Science →  Signal Processing
Network Security and Intrusion Detection
Physical Sciences →  Computer Science →  Computer Networks and Communications
Software Testing and Debugging Techniques
Physical Sciences →  Computer Science →  Software

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.