首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Research on internet traffic classification techniques using supervised machine learning
Authors:Li Jun  Zhang Shunyi  Wang Pan  Li Cuilian
Institution:1. Information Networking Institute,Nanjing University of Posts and Telecommunications,Nanjing 210003,P.R.China;Department of Telecommunication Engineering,Zhejiang Wanli University,Ningbo 315100,P.R.Chi
2. Information Networking Institute,Nanjing University of Posts and Telecommunications,Nanjing 210003,P.R.China
3. Department of Telecommunication Engineering,Zhejiang Wanli University,Ningbo 315100,P.R.China
Abstract:Internet traffic classification is vital to the areas of network operation and management. Traditional classification methods such as port mapping and payload analysis are becoming increasingly difficult as newly emerged applications (e.g. Peer-to-Peer) using dynamic port numbers, masquerading techniques and encryption to avoid detection. This paper presents a machine learning (ML) based traffic classification scheme, which offers solutions to a variety of network activities and provides a platform of performance evaluation for the classifiers. The impact of dataset size, feature selection, number of application types and ML algorithm selection on classification performance is analyzed and demonstrated by the following experiments: (1) The genetic algorithm based feature selection can dramatically reduce the cost without diminishing classification accuracy. (2) The chosen ML algorithms can achieve high classification accuracy. Particularly, REPTree and C45 outperform the other ML algorithms when computational complexity and accuracy are both taken into account. (3) Larger dataset and fewer application types would result in better classification accuracy. Finally, early detection with only several initial packets is proposed for real-time network activity and it is proved to be feasible according to the preliminary results.
Keywords:supervised machine learning  traffic classification  feature selection  genetic algorithm (GA)
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号