Computer Engineering and Applications ›› 2015, Vol. 51 ›› Issue (19): 88-92.

Previous Articles     Next Articles

Identifying P2P flow in cloud computing

SHAN Kai, GAO Zhonghe, LI Fengyin   

  1. School of Computer Science College, Qufu Normal University, Rizhao, Shandong 276826, China
  • Online:2015-09-30 Published:2015-10-13

云计算环境下的P2P流量识别

单  凯,高仲合,李凤银   

  1. 曲阜师范大学 计算机科学学院,山东 日照 276826

Abstract: Due to memory limitations, P2P traffic identification can only deal with small-scale data sets in a stand-alone environment. And all the attribute characteristics used in the P2P traffic identification based on Bayesian classification are artificial selected. Therefore, the recognition rate is both restricted and lack of objectivity. Based on the above analysis, a Naive Bayesian classification algorithm is proposed in the cloud computing environment, and then an attribute reduction algorithm is improved to adapt to the cloud computing environment. Finally, both above algorithms are combined to achieve fine-grained encrypted P2P traffic identification. The experimental results show that this method can efficiently process large data sets of network traffic, and the recognition rate of P2P flow is high, and the results are objective at the same time.

Key words: P2P traffic identication, cloud computing, rough set, Naive Bayesian

摘要: 由于内存限制使得单机环境下的P2P流量识别方法只能对小规模数据集进行处理,并且基于朴素贝叶斯分类的识别方法所使用的属性特征均为人工选择,因此,识别率受到了限制并且缺乏客观性。基于以上问题分析提出了云计算环境下的朴素贝叶斯分类算法并改进了在云计算环境下属性约简算法,结合这两个算法实现了对加密P2P流量的细粒度识别。实验结果表明该方法可以高效处理大数据集网络流量,并且有很高的P2P流量识别率,同时结果也具备客观性。

关键词: P2P流量识别, 云计算, 粗糙集, 朴素贝叶斯