Study on Fault Tolerance for Virtualization-Based Computer Simulation Systems

Article Preview

Abstract:

Modern computer simulation system has developed towards the direction of large-scale and distributed computing pattern. The large-scale simulation applications always deploy over heterogeneous networks across geographically dispersed locations, and the simulation process often lasts for a long time without intermission. The challenge is that various errors cannot be avoided during a long continuous running time in such a broad network environment with a huge number of simulation resources. The problem of simulation fault tolerance has become a hot issue. This paper introduces live migration method to virtualization-based computer simulation system, handling reliability problems, especially fault tolerance issues. The paper presents a framework of simulation fault tolerance. Then the detailed live migration mechanism of run-time simulation is discussed. The method can provide an approach to consolidating the reliable simulation in distributed and long-term simulation applications.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 201-203)

Pages:

677-680

Citation:

Online since:

February 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] B. Li: Systems Modeling and Simulation: Theory and Applications (Springer Heidelberg, Germany 2005), pp.12-22.

Google Scholar

[2] B. Espen: Management of High Availability Services Using Virtualization (Master Degree, University of OSLO, Norway 2006), pp.7-9.

Google Scholar

[3] M. Rosenblum and T. Garfinkel: Virtual Machine Monitors Current Technology and Future Trends, IEEE Computer, Vol. 38 (2005) No. 5, pp.39-47.

DOI: 10.1109/mc.2005.176

Google Scholar

[4] C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt and A. Warfield: Proc. of the 2nd ACM/USENIX Symposium on Networked Systems Design and Implementation (Boston, MA, USA, May 2005), Vol. 2 (2005), pp.273-286.

Google Scholar

[5] B. Nagarajan, F. Mueller, C. Engelmann and S. L. Scott: Proc. of the 21st Annual International Conference on Supercomputing (New York, NY, USA, 2007), pp.23-32.

Google Scholar

[6] A. Tikotekar, G. Vallee, T. Naughton, S.L. Scott and C. Leangsuksun: Proc. Of the 9th IEEE International Conference on Cluster Computing (Austin, Texas, USA, Sept. 17-20, 2007), pp.17-20.

DOI: 10.1109/clustr.2007.4629244

Google Scholar

[7] W. Voorsluys, J. Broberg, S. Venugopal and R. Buyya: Proc. Of the 1st International Conference on Cloud Computing (Beijing, China, December 1-4, 2009), pp.254-265.

Google Scholar