A New Iterative Modularity-Based Method for Graph Clustering on Scalable Networks

Article Preview

Abstract:

Due to the advancement of technology, modern networks such as social networks, citation networks, Web networks have been extremely large, reaching millions of nodes in a network. But most of the existing graph clustering algorithms can only tackle with small or medium-size networks. In this paper, we introduce a new method which can achieve high graph clustering quality for large scale networks by optimizing the modularity function. It is based on the iterative idea and takes good advantage of the exsiting multilevel local search heuristics. After introducing this modularity-based method, we evaluate its performance by applying it to several well-known network datasets. With a cost of more but acceptable time, it outperforms the best algorithms in the literature in the case of modularity optimization quality.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2562-2567

Citation:

Online since:

September 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Schaeffer S E: Computer Science Review, Vol. 1 (2007) No. 1, p.27.

Google Scholar

[2] Newman M E J: SIAM review, Vol. 45(2003) No. 2, p.167.

Google Scholar

[3] Kim S, Shi T: arXiv preprint arXiv, Vol. 1211(2012) No. 6807.

Google Scholar

[4] Von Luxburg U: Statistics and computing, Vol. 17(2007) No. 4, p.395.

Google Scholar

[5] Newman M E J: arXiv, Vol. 1307(2013) No. 7729.

Google Scholar

[6] Newman M E J, Girvan M: Physical review E, 2004, Vol. 69(2004) No. 2.

Google Scholar

[7] Brandes U, Delling D, Gaertler M, et al: Knowledge and Data Engineering, IEEE Transactions on, Vol. 20(2008) No. 2, p.172.

Google Scholar

[8] Newman M E J: Physical review E, Vol. 69(2004) No. 6.

Google Scholar

[9] Ovelgönne M, Geyer-Schulz A: Challenges at the Interface of Data Analysis, Computer Science, and Optimization (Springer Berlin Heidelberg, Germany 2012), p.225.

Google Scholar

[10] Ng A Y, Jordan M I, Weiss Y: Advances in neural information processing systems, Vol. 2(2002), p.849.

Google Scholar

[11] Dhillon I S, Guan Y, Kulis B. Kernel: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (ACM, 2004), p.551.

DOI: 10.1145/1014052.1014118

Google Scholar

[12] Reichardt J, Bornholdt S: Physical Review E, Vol. 74(2006) No. 1.

Google Scholar

[13] Lukashin A V, Fuchs R: Bioinformatics, Vol. 17(2001) No. 5, p.405.

Google Scholar

[14] Ovelgönne M, Geyer-Schulz A: Graph Partitioning and Graph Clustering, Vol. 588(2012), p.187.

DOI: 10.1090/conm/588/11701

Google Scholar

[15] Rotta R, Noack A: Journal of Experimental Algorithmics (JEA), 2011, Vol. 16(2011) No. 2, p.3.

Google Scholar

[16] Newman M E J: Proceedings of the National Academy of Sciences, Vol. 103(2006) No. 23, p.8577.

Google Scholar

[17] Blondel V D, Guillaume J L, Lambiotte R, et al: Journal of Statistical Mechanics: Theory and Experiment, Vol. 2008(2008) No. 10, p.10008.

Google Scholar

[18] Information on http: /dblp. uni-trier. de.

Google Scholar

[19] Leskovec J, Lang K J, Dasgupta A, et al: Internet Mathematics, Vol. 6(2009) No. 1), p.29.

Google Scholar

[20] Leskovec J, Huttenlocher D, Kleinberg J: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (ACM, 2010), p.1361.

Google Scholar

[21] Leskovec J, Huttenlocher D, Kleinberg J: Proceedings of the 19th International Conference on World Wide Web (ACM, 2010), p.641.

Google Scholar

[22] Backstrom L, Huttenlocher D, Kleinberg J, et al: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (ACM, 2006), p.44.

DOI: 10.1145/1150402.1150412

Google Scholar

[23] Leskovec J, Lang K J, Dasgupta A, et al: Internet Mathematics, 2009, Vol. 6(2009) No. 1, p.29.

Google Scholar

[24] Information on http: /wiki. dbpedia. org/Datasets.

Google Scholar

[25] Medelyan O, Legg C: Proceedings of the WIKI-AI: Wikipedia and AI Workshop at the AAAI08 Conference (Chicago, 2008).

Google Scholar

[26] Pohl A: Proceedings of the Web of Linked Entities Workshop in conjuction with the 11th International Semantic Web Conference (2012).

Google Scholar

[27] Information on http: /www. w3. org/TR/rdf-schema.

Google Scholar

[28] Giannini S. Business Information Systems Workshops (Springer Berlin Heidelberg, Germany 2013), p.220.

Google Scholar

[29] Yang J, Leskovec J. Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics (ACM, 2012), Vol. 3.

Google Scholar

[30] Information on http: /hadoop. apache. org.

Google Scholar