A Novel Self-Supervised Framework Based on Masked Autoencoder for Traffic Classification

Ruijie Zhao; Mingwei Zhan; Xianwen Deng; Fangqi Li; Yanhao Wang; Yijun Wang; Guan Gui; Zhi Xue

doi:10.1109/tnet.2023.3335253

ScienceGate Book Chapters

JOURNAL ARTICLE

A Novel Self-Supervised Framework Based on Masked Autoencoder for Traffic Classification

Ruijie Zhao Mingwei Zhan Xianwen Deng Fangqi Li Yanhao Wang Yijun Wang Guan Gui Zhi Xue

Year: 2024 Journal: IEEE/ACM Transactions on Networking Vol: 32 (3)Pages: 2012-2025 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tnet.2023.3335253

Get Full-Text PDF Get Analytical Report

Abstract

Traffic classification is a critical task in network security and management. Recent research has demonstrated the effectiveness of the deep learning-based traffic classification method. However, the following limitations remain: (1) the traffic representation is simply generated from raw packet bytes, resulting in the absence of important information; (2) the model structure of directly applying deep learning algorithms does not take traffic characteristics into account; and (3) scenario-specific classifier training usually requires a labor-intensive and time-consuming process to label data. In this paper, we introduce a masked autoencoder (MAE) based traffic transformer with multi-level flow representation to tackle these problems. To model raw traffic data, we design a formatted traffic representation matrix with hierarchical flow information. After that, we develop an efficient Traffic Transformer, in which packet-level and flow-level attention mechanisms implement more efficient feature extraction with lower complexity. At last, we utilize MAE paradigm to pre-train our classifier with a large amount of unlabeled data, and perform fine-tuning with a few labeled data for a series of traffic classification tasks. Experiment findings reveal that our method outperforms state-of-the-art methods on five real-world traffic datasets by a large margin. The code is available at https://github.com/NSSL-SJTU/YaTC.

Keywords:

Computer science Autoencoder Traffic classification Artificial intelligence Machine learning Byte Data mining Feature learning Classifier (UML) Feature extraction Network packet Deep learning Computer network

Metrics

Cited By

12.78

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Internet Traffic Analysis and Secure E-voting

Physical Sciences → Computer Science → Artificial Intelligence

Network Security and Intrusion Detection

Physical Sciences → Computer Science → Computer Networks and Communications

Network Packet Processing and Optimization

Physical Sciences → Computer Science → Hardware and Architecture

A Novel Self-Supervised Framework Based on Masked Autoencoder for Traffic Classification

Abstract

Metrics

Citation History

Topics

Related Documents

Self-Supervised Learning Malware Traffic Classification Based on Masked Autoencoder

Medical image classification using self-supervised learning-based masked autoencoder

Masked autoencoder-based self-supervised learning for forest plant classification

Masked autoencoder-based self-supervised learning for forest plant classification

Masked autoencoder-based self-supervised learning for forest plant classification