Study of Map-Reduce over Hadoop Based Cloud Computing Environment

Article Preview

Abstract:

Popularity for the term Cloud-Computing has been increasing in recent years. In addition to the SQL technique, Map-Reduce, a programming model that realizes implementing large-scale data processing, has been a hot topic that is widely discussed through many studies. Many real-world tasks such as data processing for search engines can be parallel-implemented through a simple interface with two functions called Map and Reduce. We focus on comparing the performance of the Hadoop implementation of Map-Reduce with SQL Server through simulations. Hadoop can complete the same query faster than SQL Server. On the other hand, some concerned factors are also tested to see whether they would affect the performance for Hadoop or not. In fact more machines included for data processing can make Hadoop achieve a better performance, especially for a large-scale data set.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

175-181

Citation:

Online since:

February 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] I. Shadi, J. Hai, L. Lu, L. Qi, S. Wu, and X. -H. Shi, Evaluating MapReduce on Virtual Machines: The Hadoop Case, in Proceedings of the 1st International Conference on Cloud Computing Beijing, China: Springer-Verlag, (2009).

DOI: 10.1007/978-3-642-10665-1_47

Google Scholar

[2] L. -Q. Li, An optimistic differentiated service job scheduling system for Cloud Computing service users and providers, Qingdao, China, 2009, pp.295-299.

DOI: 10.1109/mue.2009.58

Google Scholar

[3] D. Jeffrey and G. Sanjay, MapReduce: simplified data processing on large clusters, Commun. ACM, vol. 51, pp.107-113, (2008).

DOI: 10.1145/1327452.1327492

Google Scholar

[4] J. L. Johnson, SQL in the Clouds, Computing in Science & Engineering, vol. 11, pp.12-28, (2009).

Google Scholar

[5] Y. -C. Tsay, Application of Java on Statistics Education, Department of Applied Mathematics, National Sun Yat-Sen University, Kaohsiung, Taiwan, July (2000).

DOI: 10.3934/jimo.2009.5.161

Google Scholar

[6] G. Mackey, S. Sehrish, J. Bent, J. Lopez, S. Habib, and J. Wang, Introducing map-reduce to high end computing, " in Petascale Data Storage Workshop, 2008. PDSW , 08. 3rd, 2008, pp.1-6.

DOI: 10.1109/pdsw.2008.4811889

Google Scholar

[7] SQL Server Developer Center, http: /msdn. microsoft. com/en-us/sqlserver/default. aspx.

Google Scholar

[8] Apache, Welcome to Hadoop!, http: /hadoop. apache. org/, (2009).

Google Scholar