Research on the System of Data Mining Based on Hadoop

Xin Yu Zhen; Yong Xia

doi:10.4028/www.scientific.net/AMM.687-691.1157

Paper Titles

The Application of Data Mining Technology in the Remote Open Management System
p.1141

Evaluation Model of Students' Entrepreneurship and Innovation in Higher Vocational College under Regional Development
p.1145

Preprocessing in Biomedical Literature Mining Using Natural Language Processing
p.1149

Research on Generalization Technology of Spatial Line Vector Data
p.1153

Research on the System of Data Mining Based on Hadoop
p.1157

Fuzzy Maximum Independent Set Problem
p.1161

Explore the Use of Computer-Aided Design in the Landscape Renderings
p.1166

Motions Blur Effects in the Process of Three-Dimensional Animation Technology Research
p.1170

The Design and Implementation of Key Technologies of the Computer Algorithm Dynamic System
p.1174

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 687-691Research on the System of Data Mining Based on...

Research on the System of Data Mining Based on Hadoop

Abstract:

Hadoop, is becoming a necessary part of a large-scale data mining system. Therefore, this issue is exactly a kind of practice of data mining tasks on the hadoop distributed Systems. In this paper, the main task is to build a distributed cluster computation environment using hadoop and implement a data mining task in the environment. We select data clustering task as a representative, and select the K-means clustering algorithm to do in-depth research.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 687-691)

Pages:

1157-1160

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.687-691.1157

Citation:

Cite this paper

Online since:

November 2014

Authors:

Xin Yu Zhen*, Yong Xia

Keywords:

Data Mining (DM), K-Means

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Z.X. Huang. Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values. Data Mining and Knowledge Discovery, 2010, 10.

Google Scholar

[2] Clifton Phua, Vincent Lee, Kate Smith, Ross Gayler. A Comprehensive Survey of Data Mining-based Fraud Detection Research. Information of Things, 2014, 1.

Google Scholar

[3] Umesh Kumar Pandey, Saurabh Pal. Data Mining: A prediction of performer or underperformer using classification. International Journal of Computer Science and Information Technology, 2011, 2.

Google Scholar

[4] Juntao Wang. An improved K-Means clustering algorithm. Communication Software and Networks, 2011, 3.

Google Scholar