Research on Hadoop Greedy Scheduler Based on the Fair

Article Preview

Abstract:

Job scheduling technology is one of the Hadoop platform’s key technologies, whose main function is to control the execute sequence of job and the distribution of computing resources, which directly relates to the Hadoop platform’s overall performance and system resources, usage. However, the existing job scheduling algorithms such as FIFO Scheduler, Fair Scheduler and Capacity Scheduler all have some defects. To overcome theses defects, this paper proposed a new algorithm Hadoop Greedy Scheduler Based on the Fair (HGSF). Firstly, the job pools are sorted by priority from high to low; pools in the same priority are sorted by their minimum requirements. Then their minimum requirements are met in turn. If have spare computing resources, it will be assigned to the job pool which has the highest priority and the minimum difference between the ideal requirement and the minimum requirement. Finally, the algorithm uses delay allocation strategy to improve the localization of the data for computing tasks. The experimental results testified the effectiveness of the proposed algorithm.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

460-464

Citation:

Online since:

December 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Information on http: /svn. apache. org/repos/asf/hadoop/mapreduce/trunk/src/contrib/fairscheduler/designdoc/fair_scheduler_design_doc. pdf.

Google Scholar

[2] Information on http: /hadoop. apache. org/common/docs/r0. 19. 1/capacity_scheduler. html.

Google Scholar

[3] Matei Zaharia, Dhruba Borthakur, Joydeep Sen Sarma, Khaled Elmelegy, Scott Shenker, and Ion Stoica. Technical Report UCB/EECS-2009-55, EECS Department, University of California, Berkeley (2009).

Google Scholar

[4] Matei Zaharia, Dhruba Borthakur, Joydeep Sen Sarma, Khaled Elmeleegy, Scott Shenker, Ion Stoica, in: Proceedings of the 5th European conference on Computer systems, Paris, France (2010).

DOI: 10.1145/1755913.1755940

Google Scholar