p.1794
p.1798
p.1802
p.1806
p.1810
p.1814
p.1821
p.1825
p.1829
Static Fault-Tolerant Strategy for High Performance Computing Platform
Abstract:
It is an important research issue to ensure the computation correctness for parallel application and enhance the using rate of dynamic computing resource in distributed computing system. Based on the previous high performance distributing computing system, a fault-tolerant and task scheduler was developed, which combined the breathe mechanism, fault-discover mechanism and subtask reschedule mechanism. Experiments show that the fault-tolerant and task-scheduler has good performance and ensures the computation correctness even if when some computing resources fail.
Info:
Periodical:
Pages:
1810-1813
Citation:
Online since:
July 2014
Price:
Сopyright:
© 2014 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: