A Micro-Cluster-Based Data Stream Clustering Method for P2P Traffic Classification

Article Preview

Abstract:

Many machine learning techniques were proposed to classify P2P traffic and each with reasonable successes. But in the real P2P network environment, new communities of peers often attend and old communities of peers often leave. It requires the identification methods to be capable of coping with concept drift and updating the model incrementally. In this paper, we presented a concept-adapting algorithm MCStream which was based on streaming data mining techniques to identify P2P applications in Internet traffic. The MCStream used two micro-cluster structures, potential micro-cluster structures and outlier micro-cluster structures, to classify the P2P traffic and discovered the concept drift with limited memory. Our performance studied over a number of real data which was captured at a main gateway router demonstrates the effectiveness and efficiency of our method.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1121-1126

Citation:

Online since:

December 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] S.Saroiu, K. P. Gummadi, R. J. Dunn, S.D. Gribble, and H. M. Levy, "An analysis of Internet content delivery systems", in Proceeding of the 5th symposium on Operating systems design and implementation, 2002, pp.315-327. [2] M.A. Green: High Efficiency Silicon Solar Cells (Trans Tech Publications, Switzerland 1987).

DOI: 10.1145/1060289.1060319

Google Scholar

[2] T. Karagiannis, A. Broido, M. Faloutsos, and K. claffy, "Transport Layer Identification of P2P Traffic". In IMC'04, Taormina, Italy, October 25-27, 2004. [4] G. Henkelman, G.Johannesson and H. Jónsson, in: Theoretical Methods in Condencsed Phase Chemistry, edited by S.D. Schwartz, volume 5 of Progress in Theoretical Chemistry and Physics, chapter, 10, Kluwer Academic Publishers (2000).

DOI: 10.1145/1028788.1028804

Google Scholar

[3] S.Sen, O.Spatscheck, and D.Wang, "Accurate, Scalable In-Network Identification of P2P Traffic Using Application Signatures", in WWW2005, New York, USA, May 17-22, 2004. [6] P.G. Clem, M. Rodriguez, J.A. Voigt and C.S. Ashley, U.S. Patent 6,231,666. (2001)

DOI: 10.1145/988672.988742

Google Scholar

[4] Spognardi, A. Lucarelli, and R. D. Pietro, "A methodology for P2P file-sharing traffic detection", in Proceedings of the Second International Workshop on Hot Topics in Peer-to-Peer Systems – Volume 00 HOT-P2P'05, 2005, pp.52-61.

DOI: 10.1109/hot-p2p.2005.2

Google Scholar

[5] H. Bleul, E. P. Rathgeb, and S. Zilling, "Evaluation of an efficient measurement concept for P2P multiprotocol traffic analysis", in Proceedings of the 32nd EUROMICRO Conference on Software Engineering and Advanced Application, 2006, pp.414-423.

DOI: 10.1109/euromicro.2006.35

Google Scholar

[6] A. W. Moore, D. Zuev, "Internet traffic classification using Bayesian analysis techniques", ACM SIGMETRICS Performance Evaluation Review, 2005, 33(1):50-60.

DOI: 10.1145/1071690.1064220

Google Scholar

[7] A. W. Moore, D. Zuev, Discrimination for use in flow-based classification, Technical report, Intel Research, Cambridge, 2005.

Google Scholar

[8] F. Hernandez, A. B. Nobel, F. D. Smith, and K. Jeffay. "Statistical Clustering of Internet Communication Patterns", in Proceedings of Symposium on the Interface of Computing Science and Statistics, 2003.

Google Scholar

[9] Watanabe O. "Simple Sampling Techniques for Discovery Science", IEICE Transactions on Information and Systems, 2000, E83-D (1): 19-26

Google Scholar

[10] B. Raahemi, W Zhong, J Liu. "Peer-to-Peer Traffic Identification by Mining IP Layer Data Streams Using Concept-Adapting Very Fast Decision Tree", 20th IEEE International Conference on Tools with Artificial Intelligence, 2008.

DOI: 10.1109/ictai.2008.12

Google Scholar

[11] F. Cao, M. Ester, W. Qian, and A. Zhou, "Density-based clustering over an evolving data stream with noise", in SIAM Conference on Data Mining (SDM), pages 326-337, 2006.

DOI: 10.1137/1.9781611972764.29

Google Scholar