Study on the Optimization of Data Mining in Big Data

Article Preview

Abstract:

This paper proposes an analysis measure for Big Data by optimizing traditional data mining, base on Weka data analyzing platform ,K-means algorithm is employed through the interface programs in Weka system, so that optimized data mining techniques can be applied in cloud storage, cloud computing of Big Data by clustering analysis base on Big Data pre-processing and real-time monitoring of memory.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 989-994)

Pages:

1837-1840

Citation:

Online since:

July 2014

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Hui Yan, Haiyan Hu. Design and Realization of Innovation and entrepreneurship Service Platform for Undergraduates Based on Big Data[J]. ITMI2013. 2013. 7: 394~397.

DOI: 10.4028/www.scientific.net/amm.411-414.394

Google Scholar

[2] Xiaojun Chen, Xiaofei Xu, Yunming Ye, etc. TW - k - means: automated two - level variable weighting clustering algorithm for multi - view data[J]. IEEE Transactions on Knowledge and Data Engineering,http: / /doi. ieeecomputersociety. org /10. 1109 /TKDE. 2011. 262.

DOI: 10.1109/tkde.2011.262

Google Scholar

[3] Xiaojun Chen, Yunming Ye, Xiaofei Xu etc. A feature group weighting method for subspace clustering of high - dimensional data[J]. Pattern Recognition, 2012, 45( 1) : 434-446.

DOI: 10.1016/j.patcog.2011.06.004

Google Scholar

[4] Bingguo Li, Xiaojun Chen, Mark Junjie Li, etc. Scalable random forests for massive data[C]. PAKDD, (2012).

Google Scholar

[5] Baoxun Xu, Joshua Zhexue Huang, Graham Willams, etc. Classifying very high - dimensional data with random forests built from small subspaces[J], International Journal of Data Warehouse and Mining, 2012, 8( 2) : 45-62.

DOI: 10.4018/jdwm.2012040103

Google Scholar