PDF HTML阅读 XML下载 导出引用 引用提醒 基于C4.5决策树的流量分类方法 DOI: 作者: 作者单位: 作者简介: 通讯作者: 中图分类号: 基金项目: Supported by the National Basic Research Program of China under Grant No.2007CB307100 (国家重点基础研究发展计划(973)) Internet Traffic Classification Using C4.5 Decision Tree Author: Affiliation: Fund Project: 摘要 | 图/表 | 访问统计 | 参考文献 | 相似文献 | 引证文献 | 资源附件 | 文章评论 摘要:近年来,利用机器学习方法处理流量分类问题成为网络测量领域一个新兴的研究方向.在现有研究中,朴素贝叶斯方法及其改进算法以其实现简单、分类高效的特点而被广泛应用.但此类方法过分依赖于样本在样本空间的分布,具有潜在的不稳定性.为此,引入C4.5决策树方法来处理流量分类问题.该方法利用训练数据集中的信息熵来构建分类模型,并通过对分类模型的简单查找来完成未知网络流样本的分类.理论分析和实验结果都表明,利用C4.5决策树来处理流量分类问题在分类稳定性上均具有明显的优势. Abstract:In recent years, Internet traffic classification using machine learning has become a new direction in network measurement. Being simple and efficient Na?ve Bayes and its improved methods have been widely used in this area. But these methods depend too much on probability distribution of sample spacing, so they have connatural instability. To handle this problem, a new method based on C4.5 decision tree is proposed in this paper. This method builds a classification model using information entropy in training data and classifies flows just by a simple search of the decision tree. The theoretical analysis and experimental results show that there are obvious advantages in classification stability when C4.5 decision tree method is used to classify Internet traffic. 参考文献 相似文献 引证文献
B SiregarErna Budhiarti NababanNoviyanti T M SagalaU AndayaniFahmi Fahmi
Kartono PinaryantoRobertus Adi NugrohoYanuarius Basilius
Baiq Andriska Candra PermanaRamli AhmadHariman BahtiarAris SudiantoIrwan Prasetya Gunawan
Jefri Junifer PangaribuanAlexander Putra