JOURNAL ARTICLE

Distributionally Robust Federated Learning for Network Traffic Classification With Noisy Labels

Siping ShiYingya GuoDan WangYifei ZhuZhu Han

Year: 2023 Journal:   IEEE Transactions on Mobile Computing Vol: 23 (5)Pages: 6212-6226   Publisher: IEEE Computer Society

Abstract

Network traffic classifiers of mobile devices are widely learned with federated learning(FL) for privacy preservation. Noisy labels commonly occur in each device and deteriorate the accuracy of the learned network traffic classifier. Existing noise elimination approaches attempt to solve this by detecting and removing noisy labeled data before training. However, they may lead to poor performance of the learned classifier, as the remaining traffic data in each device is few after noise removal. Motivated by the observation that the data feature of the noisy labeled traffic data is clean and the underlying true distribution of the noisy labeled data is statistically close to the clean traffic data, we propose to utilize the noisy labeled data by normalizing it to be close to the clean traffic data distribution. Specifically, we first formulate a distributionally robust federated network traffic classifier learning problem (DR-NTC) to jointly take the normalized traffic data and clean data into training. Then we specify the normalization function under Wasserstein distance to transform the noisy labeled traffic data into a certified robust region around the clean data distribution, and we reformulate the DR-NTC problem into an equivalent DR-NTC-W problem. Finally, we design a robust federated network traffic classifier learning algorithm, RFNTC, to solve the DR-NTC-W problem. Theoretical analysis shows the robustness guarantee of RFNTC. We evaluate the algorithm by training classifiers on a real-world dataset. Our experimental results show that RFNTC significantly improves the accuracy of the learned classifier by up to 1.05 times.

Keywords:
Computer science Classifier (UML) Traffic classification Robustness (evolution) Noisy data Data mining Artificial intelligence Training set Machine learning Pattern recognition (psychology) Computer network Quality of service

Metrics

12
Cited By
3.07
FWCI (Field Weighted Citation Impact)
44
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Internet Traffic Analysis and Secure E-voting
Physical Sciences →  Computer Science →  Artificial Intelligence
Privacy-Preserving Technologies in Data
Physical Sciences →  Computer Science →  Artificial Intelligence
Network Security and Intrusion Detection
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Communication-Efficient Robust Federated Learning with Noisy Labels

Junyi LiJian PeiHeng Huang

Journal:   Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining Year: 2022 Pages: 914-924
BOOK-CHAPTER

Federated Learning with Local Openset Noisy Labels

Zonglin DiZhaowei ZhuXiaoxiao LiYang Liu

Lecture notes in computer science Year: 2024 Pages: 38-56
© 2026 ScienceGate Book Chapters — All rights reserved.